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BAKER BOTTS L.L.P. 
30 ROCKEFELLER PLAZA 
NEW YORK, NEW YORK 10112 



TO ALL WHOM IT MAY CONCERN: 

Be it known that I, Margarita GARCIA-CALVO, have invented an improvement 

in 

NPC1L1 (NPC3) AND METHODS OF IDENTIFYING 
LIGANDS THEREOF 

of which the following is a 

SPECIFICATION 
FIELD OF THE INVENTION 
[0001] The present invention includes NPC1L1 polypeptides and polynucleotides which 
encode the polypeptides, methods of use and methods of identifying modulators and ligands 
thereof. 

BACKGROUND OF THE INVENTION 

[0002] A factor leading to development of vascular disease, a leading cause of death in 
industrialized nations, is elevated serum cholesterol. It is estimated that 19% of Americans 
between the ages of 20 and 74 years of age have high serum cholesterol. The most prevalent 
form of vascular disease is arteriosclerosis, a condition associated with the thickening and 
hardening of the arterial wall. Arteriosclerosis of the large vessels is referred to as 
atherosclerosis. Atherosclerosis is the predominant underlying factor in vascular disorders such 
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as coronary artery disease, aortic aneurysm, arterial disease of the lower extremities and 
cerebrovascular disease. 

[0003] Cholesteryl esters are a major component of atherosclerotic lesions and the major 
storage form of cholesterol in arterial wall cells. Formation of cholesteryl esters is also a step in 
the intestinal absorption of dietary cholesterol. Thus, inhibition of cholesteryl ester formation 
and reduction of serum cholesterol can inhibit the progression of atherosclerotic lesion 
formation, decrease the accumulation of cholesteryl esters in the arterial wall, and block the 
intestinal absorption of dietary cholesterol. 

[0004] The regulation of whole-body cholesterol homeostasis in mammals and animals 
involves the regulation of intestinal cholesterol absorption, cellular cholesterol trafficking, 
dietary cholesterol and modulation of cholesterol biosynthesis, bile acid biosynthesis, steroid 
biosynthesis and the catabolism of the cholesterol-containing plasma lipoproteins. Regulation of 
intestinal cholesterol absorption has proven to be an effective means by which to regulate serum 
cholesterol levels. For example, a cholesterol absorption inhibitor, ezetimibe ( 



), has been shown to be effective in this regard. A pharmaceutical composition containing 
ezetimibe is commercially available from Merck/Schering-Plough Pharmaceuticals, Inc. under 




OH. 



F 



F 
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the tradename Zetia®. Identification of a gene target through which ezetimibe acts is important 
to understanding the process of cholesterol absorption and to the development of other, novel 
absorption inhibitors. The present invention addresses this need by providing a rat and a mouse 
homologue of human NPC1L1 (also known as NPC3; Genbank Accession No. AF192522; 
Davies, et al, (2000) Genomics 65(2): 137-45 and Ioannou, (2000) Mol. Genet. Metab.71(l-2): 
175-81), an ezetimibe target. 

[0005] NPC1L1 is an N-glycosylated protein comprising a YQRL (SEQ ID NO: 38) motif 
(i.e., a trans-golgi network to plasma membrane transport signal; see Bos, et al, (1993) EMBO 
J. 12: 2219-2228; Humphrey, etal, (1993) J. Cell. Biol. 120: 1123-1135; Ponnambalam, etal, 
(1994) J. Cell. Biol. 125: 253-268 and Rothman, et al, (1996) Science 272: 227-234) which 
exhibits limited tissue distribution and gastrointestinal abundance. Also, the human NPC1L1 
promoter includes a Sterol Regulated Element Binding Protein 1 (SREBP1) binding consensus 
sequence (Athanikar, et al, (1998) Proc. Natl. Acad. Sci. USA 95: 4935-4940; Ericsson, et al, 
(1996) Proc. Natl. Acad. Sci. USA 93: 945-950; Metherall, et al, (1989) J. Biol. Chem. 264: 
15634-15641; Smith, et al, (1990) J. Biol. Chem. 265: 2306-2310; Bennett, et al, (1999) J. Biol. 
Chem. 274: 13025-13032 and Brown, et al, (1997) Cell 89: 331-340). NPC1L1 has 42% amino 
acid sequence homology to human NPC1 (Genbank Accession No. AF002020), a receptor 
responsible for Niemann-Pick CI disease (Carstea, et al, (1997) Science 277: 228-231). 
Niemann-Pick CI disease is a rare genetic disorder in humans which results in accumulation of 
low density lipoprotein (LDL)-derived unesterified cholesterol in lysosomes (Pentchev, et al, 
(1994) Biochim. Biophys. Acta. 1225: 235-243 and Vanier, etal, (1991) Biochim. Biophys. 
Acta. 1096: 328-337). In addition, cholesterol accumulates in the trans-golgi network of npcl" 
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cells, and relocation of cholesterol, to and from the plasma membrane, is delayed. NPC1 and 
NPC1L1 each possess 13 transmembrane spanning segments as well as a sterol-sensing domain 
(SSD). Several other proteins, including HMG-CoA Reductase (HMG-R), Patched (PTC) and 
Sterol Regulatory Element Binding Protein Cleavage-Activation Protein (SCAP), include an 
SSD which is involved in sensing cholesterol levels possibly by a mechanism which involves 
direct cholesterol binding (Gil, et ah, (1985) Cell 41: 249-258; Kumagai, et ah, (1995) J. Biol. 
Chem. 270: 19107-19113 and Hua, etal, (1996) Cell 87: 415-426). 

SUMMARY OF THE INVENTION 
[0006] The present invention is based on the discovery that NPC1L1 is the target through which 

ezetimibe act, and consequently plays a critical role in the regulation of sterol and 5oc-stanol 
intestinal transport and absorption, e.g. cholesterol absorption. Accordingly, this invention 
provides for the use of NPC1L1 in an assay for identifying ligands that block NPC1 LI -mediated 
sterol and 5cc-stanol intestinal transport. The present invention provides methods for identifying 
ligands of NPC1L1 which involve contacting NPC1L1 with a detectably labeled substituted 2- 
azetidinone, preferably substituted 2-azetidinone-glucuronide, and a candidate compound, and 
determining whether the candidate compound binds to NPC1L1 . The modulation of the binding 
of the substituted 2-azetidinone to NPC1L1 by the binding of the candidate compound to 
NPC1L1 indicates that the candidate compound is a ligand that binds to NPC1L1 and is an 
inhibitor of sterol and 5cc-stanol absorption. 

[0007] The present invention also provides a method for identifying a ligand of NPC1L1 
comprising contacting NPC1L1 with a detectably labeled substituted 2-azetidinone, preferably 
substituted 2-azetidinone-glucuronide, and measuring the binding of detectably labeled 
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substituted 2-azetidinone to NPC1L1 in the presence and absence of a candidate compound, 
wherein decreased binding of the detectably labeled substituted 2-azetidinone to the NPC1L1 in 
the presence of the candidate compound indicates that said candidate compound is a ligand of 
NPC1L1 and is an inhibitor of sterol and 5ct-stanol absorption. 

[0008] The present invention also provides for a method for identifying a compound that inhibits 
intestinal sterol or 5a-stanol absorption mediated by NPC1L1 involving contacting NPC1L1 
with a detectably labeled ligand and the candidate compound and determining whether the 
candidate compound binds to NPC1L1, wherein binding of said candidate compound to NPC1L1 
modulates binding of said ligand to NPC1 LI, wherein said modulation indicates that the 
candidate compound is an intestinal sterol or 5a-stanol absorption inhibitor. 

[0009] The present invention provides methods for identifying an ligand of NPC1L1 
comprising (a) contacting a host cell (e.g., Chinese hamster ovary (CHO) cell, a J774 cell, a 
macrophage cell or a Caco2 cell) expressing a polypeptide comprising the amino acid sequence 
of SEQ ID NO: 2 or SEQ ID NO: 4 or SEQ ID NO: 12 or a functional fragment thereof on a cell 
surface, in the presence of a known amount of a detectably labeled (e.g., with 3 H, 14 C, 125 1, 35 S or 
fluorescence labeling) substituted azetidinone (e.g., ezetimibe), with a sample to be tested for the 
presence of an NPC1L1 ligand; and (b) measuring the amount of detectably labeled substituted 
azetidinone (e.g., ezetimibe) specifically bound to the polypeptide; wherein an NPC1L1 ligand in 
the sample is identified by measuring substantially reduced binding of the detectably labeled 
substituted azetidinone (e.g., ezetimibe) to the polypeptide, compared to what would be 
measured in the absence of such a ligand. 
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[0010] Another method for identifying an ligand of NPC1L1 is also provided. The method 
comprises (a) placing, in an aqueous suspension, a plurality of support particles, impregnated 
with a fluorescer (e.g., yttrium silicate, yttrium oxide, diphenyloxazole and polyvinyltoluene), to 
which a host cell (e.g., Chinese hamster ovary (CHO) cell, a J774 cell, a macrophage cell or a 
Caco2 cell) expressing a polypeptide comprising the amino acid sequence of SEQ ID NO: 2 or 
SEQ ID NO: 4 or SEQ ID NO: 12 or a functional fragment thereof on a cell surface are attached; 
(b) adding, to the suspension, a radiolabeled (e.g., with 3 H, 14 C or 125 I) substituted azetidinone 
(e.g., ezetimibe) and a sample to be tested for the presence of a ligand, wherein the radiolabel 
emits radiation energy capable of activating the fluorescer upon the binding of the substituted 
azetidinone (e.g., ezetimibe) to the polypeptide to produce light energy, whereas radiolabeled 
substituted azetidinone (e.g., ezetimibe) that does not bind to the polypeptide is, generally, too 
far removed from the support particles to enable the radioactive energy to activate the fluorescer; 
and (c) measuring the light energy emitted by the fluorescer in the suspension; wherein an 
NPC1L1 ligand in the sample is identified by measuring substantially reduced light energy 
emission, compared to what would be measured in the absence of such a ligand. 

[0011] Also provided is a method for identifying a ligand of NPC1L1 comprising (a) 
contacting a host cell (e.g., Chinese hamster ovary (CHO) cell, a J774 cell, a macrophage cell or 
a Caco2 cell) expressing a polypeptide comprising an amino acid sequence of SEQ ID NO: 2 or 
SEQ ID NO: 4 or SEQ ID NO: 12 or a functional fragment thereof on a cell surface with 
detectably labeled (e.g., with 3 H, 14 C or l25 I) sterol (e.g., cholesterol) or 5cc-stanol and with a 
sample to be tested for the presence of an ligand; and (b) measuring the amount of detectably 
labeled sterol (e.g., cholesterol) or 5oc-stanol in the cell; wherein an NPC1L1 antagonist in the 
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sample is identified by measuring substantially reduced detectably labeled sterol (e.g., 
cholesterol) or 5oc-stanol within the host cell, compared to what would be measured in the 
absence of such an antagonist and wherein an NPC1L1 agonist in the sample is identified by 
measuring substantially increased detectably labeled sterol (e.g., cholesterol) or 5cc-stanol within 
the host cell, compared to what would be measured in the absence of such an agonist. 

[0012] The present invention includes methods for inhibiting NPC1 LI -mediated intestinal 
sterol (e.g., cholesterol) or 5a-stanol uptake, in a subject, by administering a substance identified 
by the screening methods described herein to the subject. Such substances include compounds 
such as small molecule antagonists of NPC1L1 other than ezetimibe. Also contemplated are 
methods for antagonizing NPC1 LI -mediated sterol (e.g., cholesterol) or 5ct-stanol absorption by 
administering anti-NPClLl antibodies. NPC1 LI -mediated absorption of sterol (e.g., 
cholesterol) or 5oc-stanol can also be antagonized by any method which reduces expression of 
NPC1L1 in an organism. For example, NPC1L1 expression can be reduced by introduction of 
anti-sense NPC1L1 mRNA into a cell of an organism or by genetic mutation of the NPC1L1 
gene in an organism (e.g., by complete knockout, disruption, truncation or by introduction of one 
or more point mutations). 

[0013] Also included in the present invention is a mutant transgenic mammal (e.g., mouse, 
rat, dog, rabbit, pig, guinea pig, cat, horse), preferably a mouse comprising a homozygous or 
heterozygous mutation (e.g., disruption, truncation, one or more point mutations, knock out) of 
endogenous, chromosomal NPC1L1 wherein, preferably, the mouse does not produce any 
functional NPC1L1 protein. Preferably, the mutant mouse, lacking functional NPC1L1, exhibits 
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a reduced level of intestinal sterol (e.g., cholesterol) or 5a-stanol absorption and/or a reduced 
level of serum sterol (e.g., cholesterol) or 5a-stanol and/or a reduced level of liver sterol (e.g., 
cholesterol) or 5a-stanol as compared to that of a non-mutant mouse comprising functional 
NPC1L1. Preferably, in the mutant mouse chromosome, the region of NPC1L1 (SEQ ID NO: 
45) deleted is from nucleotide 790 to nucleotide 998. In one embodiment, NPC1L1 (SEQ ID 
NO: 1 1) is deleted from nucleotide 767 to nucleotide 975. Any offspring or progeny of a parent 
NPC1L1 mutant mouse (i.e., npclll) of the invention which has inherited an npclll mutant 
allele is also part of the present invention. 

[0014] The scope of the present invention also includes a method for screening a sample for 
an intestinal sterol (e.g., cholesterol) or 5a-stanol absorption antagonist comprising (a) feeding a 
sterol (e.g., cholesterol) or 5oc-stanol-containing substance (e.g., comprising radiolabeled 
cholesterol, such as 14 C-cholesterol or 3 H-cholesterol) to a first and second mouse comprising a 
functional NPC1L1 gene and to a third, mutant mouse lacking a functional NPC1L1; (b) 
administering the sample to the first mouse comprising a functional NPC1L1 but not to the 
second mouse; (c) measuring the amount of sterol (e.g., cholesterol) or 5a-stanol absorption in 
the intestine of said first, second and third mouse (e.g., by measuring serum cholesterol); and (d) 
comparing the levels of intestinal sterol (e.g., cholesterol) or 5cc-stanol absorption in each mouse; 
wherein the sample is determined to contain the intestinal sterol (e.g., cholesterol) or 5a-stanol 
absorption antagonist when the level of intestinal sterol (e.g., cholesterol) or 5oc-stanol 
absorption in the first mouse and third mouse are less than the amount of intestinal sterol 
(e.g., cholesterol) or 5a-stanol absorption in the second mouse. 
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[0015] The present invention also encompasses a kit comprising (a) a substituted azetidinone 
(e.g., ezetimibe) in a pharmaceutical dosage form (e.g., a pill or tablet comprising 10 mg 
substituted azetidinone (e.g., ezetimibe)); and (b) information, for example in the form of an 
insert, indicating that NPC1L1 is a target of ezetimibe. The kit may also include simvastatin in 
a pharmaceutical dosage form (e.g., a pill or tablet comprising 5 mg, 10 mg, 20 mg, 40 mg or 80 
mg simvastatin). The simvastatin in pharmaceutical dosage form and the ezetimibe in 
pharmaceutical dosage form can be associated in a single pill or tablet or in separate pills or 
tablets. 

[0016] The present invention also provides any isolated mammalian cell (e.g. , isolated mouse 
cell, isolated rat cell or isolated human cell) which lacks a gene which encodes or can produce a 
functional NPC1L1 polypeptide. The isolated cell can be isolated from a mutant mouse 
comprising a homozygous mutation of endogenous, chromosomal NPC1L1 wherein the mouse 
does not produce any functional NPC1L1 protein. Further, the mutation can be in a gene which 
when un-mutated encodes an amino acid sequence of SEQ ED NO: 12 (e.g., comprising a 
nucleotide sequence of SEQ ID NO: 11). The cell can be isolated or derived from duodenum, 
gall bladder, liver, small intestine or stomach tissue. The cell can be an enterocyte. 



BRIEF DESCRIPTION OF THE FIGURES 
[0017] Figure 1A shows an equilibrium saturation binding plot exhibiting the binding of 3 H- 
EZE-glucuronide to rhesus brush border membrane vesicles. 

[0018] Figure IB shows a scatchard analysis of 3 H-EZE-glucuronide binding to rhesus brush 
border membrane vesicles. 
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[0019] Figure 2A shows an equilibrium saturation binding plot exhibiting the binding of 3 H- 
EZE-glucuronide to rat brush border membrane vesicles. 

[0020] Figure 2B shows a scatchard analysis of 3 H-EZE-glucuronide binding to rat brush border 
membrane vesicles. 

[0021] Figure 3A shows association kinetic analysis of 3 H-EZE-glucuronide in rat brush border 
membrane vesicles. 

[0022] Figure 3B shows dissociation kinetic analysis of 3 H-EZE-glucuronide in rat brush border 
membrane vesicles. 

[0023] Figure 4A shows association kinetic analysis of 3 H-EZE-glucuronide in rhesus brush 
border membrane vesicles. 

[0024] Figure 4B shows dissociation kinetic analysis of 3 H-EZE-glucuronide in rhesus brush 
border membrane vesicles. 

[0025] Figure 5 shows the results of a binding assay where 3 H-EZE-glucuronide is dissociated 
by EZE-glucuronide and compound 2 from rhesus (A) and rat (B) brush border membrane 
vesicles. 

[0026] Figure 6 shows the results of a binding assay where 35 S-2 is dissociated by EZE- 
glucuronide and 2 from mouse brush border membrane vesicles. 

[0027] Figure 7 shows the distribution of 3 H-EZE-glucuronide binding to various portions of 
rhesus (A) and rat (B) intestinal tissue. 
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[0028] Figure 8 shows the results of a binding assay where S-2 is dissociated by EZE- 
glucuronide and various analogs from CHO cells transfected with rat NPC1L1. 

[0029] Figure 9 shows the results of a binding assay where 35 S-2 is dissociated by EZE- 
glucuronide and various analogs from CHO cells transfected with human NPC1L1. 

[0030] Figure 10 shows the binding of 35 S-2 to brush border membrane vesicles prepared from 
wild type (A) and NPC1L1 knockout (-/-) mice. 

[0031] Figure 1 1 shows the results of a binding assay where 35 S-2 is dissociated by compound 2 
from mouse wild type (A) and NPC1L1 knockout (-/-) (B) brush border membrane vesicles. 



DETAILED DESCRIPTION OF THE INVENTION 
[0032] The present invention includes an NPC1L1 polypeptide from rat, human and from 
mouse along with polynucleotides encoding the respective polypeptides. Preferably, the rat 
NPC1L1 polypeptide comprises the amino acid sequence set forth in SEQ ID NO: 2, the human 
NPC1L1 comprises the amino acid sequence set forth in SEQ ID NO: 4 and the mouse NPC1L1 
polypeptide comprises the amino acid sequence set forth in SEQ ED NO: 12. The rat NPC1L1 
polynucleotide of SEQ ID NO: 1 or 10 encodes the rat NPC1L1 polypeptide. The human 
NPC1L1 polynucleotide of SEQ ED NO: 3 encodes the human NPC1L1 polypeptide. The mouse 
NPC1L1 polynucleotide of SEQ ED NO: 1 1 or 13 encodes the mouse NPC1L1 polypeptide. 

[0033] The present invention includes any isolated polynucleotide or isolated polypeptide 
comprising a nucleotide or amino acid sequence referred to, below, in Table 1 . 
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Table 1. Polynucleotides and Polypeptides of the Invention. 



Polynucleotide or Polypeptide 


Sequence Identifier 


Rat NPC1L1 polynucleotide 


SEQ ID NO: 1 


Rat NPC1L1 polypeptide 


SEQ ID NO: 2 


Human NPC1L1 polynucleotide 


SEQ ID NO: 3 


Human NPC1L1 polypeptide 


SEQ ID NO: 4 


Rat NPC1L1 expressed sequence tag 
603662080F1 (partial sequence) 


SEQ ID NO: 5 


Rat NPC1L1 expressed sequence tag 
603665037F1 (partial sequence) 


SEQ ID NO: 6 


Rat NPC1L1 expressed sequence tag 
6040345 87F1 (partial sequence) 


SEQ ID NO: 7 


EST 603662080F1 with downstream 
sequences added 


SEQ ID NO: 8 


EST 603662080F1 with upstream and 
downstream sequences added 


SEQ ID NO: 9 


Back-translated polynucleotide sequence of 
ratNPClLl 


SEQ ID NO: 10 


Mouse NPC1L1 polynucleotide 


SEQ ID NO: 1 1 


Mouse NPC1L1 polypeptide 


SEQ ID NO: 12 


Back- translated polynucleotide sequence of 
mouse NPC1L1 


SEQ ID NO: 13 


Back-translated polynucleotide sequence of 
human NPC1 LI 


SEQ ID NO: 51 



[0034] A human NPC1L1 is also disclosed under Genbank Accession Number AF192522. 
As discussed below, the nucleotide sequence of the rat NPC1L1 set forth in SEQ ID NO: 1 was 
obtained from an expressed sequence tag (EST) from a rat jejunum enterocyte cDNA library. 
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SEQ ID NOs: 5-7 include partial nucleotide sequences of three independent cDNA clones. The 
downstream sequence of the SEQ ID NO: 5 EST (603662080F1) were determined; the 
sequencing data from these experiments are set forth in SEQ ID NO: 8. The upstream sequences 
were also determined; these data are set forth in SEQ ID NO: 9. 

[0035] SEQ ID NOs: 43 and 44 are the nucleotide and amino acid sequence, respectively, of 
human NPC1L1 which is disclosed under Genbank Accession No.: AF192522 (see Davies, et 
ah, (2000) Genomics 65(2): 137-45). 

[0036] SEQ ID NO: 45 is the nucleotide sequence of a mouse NPC1L1 which is disclosed 
under Genbank Accession No. AK078947. 

[0037] NPC1L1 mediates intestinal sterol {e.g., cholesterol) or 5a-stanol absorption. 
Inhibition of NPC1L1 in a patient is a useful method for reducing intestinal sterol {e.g., 
cholesterol) or 5ot-stanol absorption and serum sterol {e.g., cholesterol) or 5a-stanol in the 
patient. Reducing the level of intestinal sterol {e.g., cholesterol) or 5oc-stanol absorption and 
serum sterol {e.g., cholesterol) or 5a-stanol in a patient is a useful way in which to treat or 
prevent the occurrence of atherosclerosis, particularly diet-induced atherosclerosis. 

[0038] As used herein, the term "sterol" includes, but is not limited to, cholesterol and 
phytosterols (including, but not limited to, sitosterol, campesterol, stigmasterol and avenosterol). 

[0039] As used herein, the term "5a-stanol" includes, but is not limited to, cholestanol, 5a- 
campestanol and 5ot-sitostanol. 
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Molecular Biology 

[0040] In accordance with the present invention there may be employed conventional 
molecular biology, microbiology, and recombinant DNA techniques within the skill of the art. 
Such techniques are explained fully in the literature. See, e.g., Sambrook, Fritsch & Maniatis, 
Molecular Cloning: A Laboratory Manual , Second Edition (1989) Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York (herein "Sambrook, et al, 1989"); DNA 
Cloning: A Practical Approach , Volumes I and II (D.N. Glover ed. 1985); Oligonucleotide 
Synthesis (M.J. Gait, ed. 1984); Nucleic Acid Hybridization (B.D. Hames & S J. Higgins, eds. 
(1985)); Transcription And Translation (B.D. Hames & S.J. Higgins, eds. (1984)); Animal Cell 
Culture (R.I. Freshney, ed. (1986)); Immobilized Cells And Enzymes (ERL Press, (1986)); B. 
Perbal, A Practical Guide To Molecular Cloning (1984); F.M. Ausubel, et al. (eds.), Current 
Protocols in Molecular Biology , John Wiley & Sons, Inc. (1994). 

[0041] The back-translated sequences of SEQ ID NO: 10 and of SEQ ID NO: 13 uses the 
single-letter code shown in Table 1 of Annex C, Appendix 2 of the PCT Administrative 
Instruction in the Manual of Patent Examination Procedure. 

[0042] A "polynucleotide", "nucleic acid " or "nucleic acid molecule" may refer to the 
phosphate ester polymeric form of ribonucleosides (adenosine, guanosine, uridine or cytidine; 
"RNA molecules") or deoxyribonucleosides (deoxyadenosine, deoxyguanosine, deoxythymidine, 
or deoxycytidine; "DNA molecules"), or any phosphoester analogs thereof, such as 
phosphorothioates and thioesters, in single stranded form, double-stranded form or otherwise. 
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/ 

[0043] A "polynucleotide sequence", "nucleic acid sequence" or "nucleotide sequence" is a 
series of nucleotide bases (also called "nucleotides") in a nucleic acid, such as DNA or RNA, 
and means any chain of two or more nucleotides. 

[0044] A "coding sequence" or a sequence "encoding" an expression product, such as a 
RNA, polypeptide, protein, or enzyme, is a nucleotide sequence that, when expressed, results in 
production of the product. 

[0045] The term "gene" means a DNA sequence that codes for or corresponds to a particular 
sequence of ribonucleotides or amino acids which comprise all or part of one or more RNA 
molecules, proteins or enzymes, and may or may not include regulatory DNA sequences, such as 
promoter sequences, which determine, for example, the conditions under which the gene is 
expressed. Genes may be transcribed from DNA to RNA which may or may not be translated 
into an amino acid sequence. 

[0046] The present invention includes nucleic acid fragments of any of SEQ ID NOs: 1,5-11 
or 13. A nucleic acid "fragment" includes at least about 30 (e.g., 31, 32, 33, 34), preferably at 
least about 35 (e.g, 25, 26, 27, 28, 29, 30, 31, 32, 33 or 34), more preferably at least about 45 
(e.g., 35, 36, 37, 38, 39, 40, 41, 42, 43 or 44), and most preferably at least about 126 or more 
contiguous nucleotides (e.g., 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 150, 160, 
170, 180, 190, 200, 300, 400, 500, 1000 or 1200) from any of SEQ ID NOs: 1, 5-1 1 or 13. 

[0047] The present invention also includes nucleic acid fragments consisting of at least about 
7 (e.g., 9, 12, 17, 19), preferably at least about 20 (e.g., 30, 40, 50, 60), more preferably about 70 
(e.g., 80, 90, 95), yet more preferably at least about 100 (e.g., 105, 1 10, 1 14) and even more 
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preferably at least about 115 (e.g., 117, 119, 120, 122, 124, 125, 126) contiguous nucleotides 
from anyofSEQIDNOs: 1,5-11 or 13. 

[0048] As used herein, the term "oligonucleotide" refers to a nucleic acid, generally of no 
more than about 100 nucleotides (e.g., 30, 40, 50, 60, 70, 80, or 90), that may be hybridizable to 
a genomic DNA molecule, a cDNA molecule, or an mRNA molecule encoding a gene, mRNA, 
cDNA, or other nucleic acid of interest. Oligonucleotides can be labeled, e.g., by incorporation 
of 32 P-nucleotides, 3 H-nucleotides, 14 C-nucleotides, 35 S-nucleotides or nucleotides to which a 
label, such as biotin, has been covalently conjugated. In one embodiment, a labeled 
oligonucleotide can be used as a probe to detect the presence of a nucleic acid. In another 
embodiment, oligonucleotides (one or both of which may be labeled) can be used as PCR 
primers, either for cloning full length or a fragment of the gene, or to detect the presence of 
nucleic acids. Generally, oligonucleotides are prepared synthetically, preferably on a nucleic 
acid synthesizer. 

[0049] A "protein sequence", "peptide sequence" or "polypeptide sequence" or "amino acid 
sequence" may refer to a series of two or more amino acids in a protein, peptide or polypeptide. 

[0050] "Protein", "peptide" or "polypeptide" includes a contiguous string of two or more 
amino acids. Preferred peptides of the invention include those set forth in any of SEQ ID NOs: 2 
or 12 as well as variants and fragments thereof. Such fragments preferably comprise at least 
about 10 (e.g., 1 1, 12, 13, 14, 15, 16, 17, 18 or 19), more preferably at least about 20 (e.g., 21, 
22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40), and yet more preferably at least about 42 (e.g., 43, 44, 
45, 46, 47, 48, 49, 50, 60, 70, 80, 90, 100, 1 10, 120 or 130) or more contiguous amino acid 
residues from any of SEQ ED NOs: 2 or 12. 
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[0051] The present invention also includes polypeptides, preferably antigenic polypeptides, 
consisting of at least about 7 (e.g., 9, 10, 13, 15, 17, 19), preferably at least about 20 (e.g., 22, 24, 
26, 28), yet more preferably at least about 30 (e.g., 32, 34, 36, 38) and even more preferably at 
least about 40 (e.g., 41, 42) contiguous amino acids from any of SEQ ED NOs: 2 or 12. 

[0052] The polypeptides of the invention can be produced by proteolytic cleavage of an 
intact peptide, by chemical synthesis or by the application of recombinant DNA technology and 
are not limited to polypeptides delineated by proteolytic cleavage sites. The polypeptides, either 
alone or cross-linked or conjugated to a carrier molecule to render them more immunogenic, are 
useful as antigens to elicit the production of antibodies and fragments thereof. The antibodies 
can be used, e.g., in immunoassays for immunoaffmity purification or for inhibition of NPC1L1, 
etc. 

[0053] The terms "isolated polynucleotide" or "isolated polypeptide" include a 
polynucleotide (e.g., RNA or DNA molecule, or a mixed polymer) or a polypeptide, 
respectively, which are partially or fully separated from other components that are normally 
found in cells or in recombinant DNA expression systems. These components include, but are 
not limited to, cell membranes, cell walls, ribosomes, polymerases, serum components and 
extraneous genomic sequences. 

[0054] An isolated polynucleotide or polypeptide will, preferably, be an essentially 
homogeneous composition of molecules but may contain some heterogeneity. 
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[0055] "Amplification" of DNA as used herein may denote the use of polymerase chain 
reaction (PCR) to increase the concentration of a particular DNA sequence within a mixture of 
DNA sequences. For a description of PCR see Saiki, et al., Science (1988) 239: 487. 

[0056] The term "host cell" includes any cell of any organism that is selected, modified, 
transfected, transformed, grown, or used or manipulated in any way, for the production of a 
substance by the cell, for example, the expression or replication, by the cell, of a gene, a DNA or 
RNA sequence or a protein. Preferred host cells include Chinese hamster ovary (CHO) cells, 
murine macrophage J774 cells or any other macrophage cell line and human intestinal epithelial 
Caco2 cells. 

[0057] The nucleotide sequence of a nucleic acid may be determined by any method known 
in the art (e.g., chemical sequencing or enzymatic sequencing). "Chemical sequencing" of DNA 
includes methods such as that of Maxam and Gilbert (1977) (Proc. Natl. Acad. Sci. USA 74: 
560), in which DNA is randomly cleaved using individual base-specific reactions. "Enzymatic 
sequencing" of DNA includes methods such as that of Sanger (Sanger, et al., (1977) Proc. Natl. 
Acad. Sci. USA 74: 5463). 

[0058] The nucleic acids herein may be flanked by natural regulatory (expression control) 
sequences, or may be associated with heterologous sequences, including promoters, internal 
ribosome entry sites (IRES) and other ribosome binding site sequences, enhancers, response 
elements, suppressors, signal sequences, polyadenylation sequences, introns, 5'- and 3'- non- 
coding regions, and the like. 
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[0059] In general, a "promoter" or "promoter sequence" is a DNA regulatory region capable 
of binding an RNA polymerase in a cell (e.g., directly or through other promoter-bound proteins 
or substances) and initiating transcription of a coding sequence. A promoter sequence is, in 
general, bounded at its 3' terminus by the transcription initiation site and extends upstream (5' 
direction) to include the minimum number of bases or elements necessary to initiate transcription 
at any level. Within the promoter sequence may be found a transcription initiation site 
(conveniently defined, for example, by mapping with nuclease SI), as well as protein binding 
domains (consensus sequences) responsible for the binding of RNA polymerase. The promoter 
may be operably associated with other expression control sequences, including enhancer and 
repressor sequences or with a nucleic acid of the invention. Promoters which may be used to 
control gene expression include, but are not limited to, cytomegalovirus (CMV) promoter (U.S. 
Patent Nos. 5,385,839 and 5,168,062), the SV40 early promoter region (Benoist, et al, (1981) 
Nature 290: 304-310), the promoter contained in the 3' long terminal repeat of Rous sarcoma 
virus (Yamamoto, et al, (1980) Cell 22: 787 '-797), the herpes thymidine kinase promoter 
(Wagner, et al, (1981) Proc. Natl. Acad. Sci. USA 78: 1441-1445), the regulatory sequences of 
the metallothionein gene (Brinster, et al, (1982) Nature 296: 39-42); prokaryotic expression 
vectors such as the pMactamase promoter (Villa-Komaroff, et al, (1978) Proc. Natl. Acad. Sci. 
USA 75: 3727-3731), or the tac promoter (DeBoer, et al, (1983) Proc. Natl. Acad. Sci. USA 
80: 21-25); see also "Useful proteins from recombinant bacteria" in Scientific American (1980) 
242: 74-94; and promoter elements from yeast or other fungi such as the Gal 4 promoter, the 
ADC (alcohol dehydrogenase) promoter, PGK (phosphoglycerol kinase) promoter or the alkaline 
phosphatase promoter. 
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[0060] A coding sequence is "under the control of, "functionally associated with" or 
"operably associated with" transcriptional and translational control sequences in a cell when the 
sequences direct RNA polymerase mediated transcription of the coding sequence into RNA, 
preferably mRNA, which then may be RNA spliced (if it contains introns) and, optionally, 
translated into a protein encoded by the coding sequence. 

[0061] The terms "express" and "expression" mean allowing or causing the information in a 
gene, RNA or DNA sequence to become manifest; for example, producing a protein by 
activating the cellular functions involved in transcription and translation of a corresponding 
gene. A DNA sequence is expressed in or by a cell to form an "expression product" such as an 
RNA (e.g., mRNA) or a protein. The expression product itself may also be said to be 
"expressed" by the cell. 

[0062] The term "transformation" means the introduction of a nucleic acid into a cell. The 
introduced gene or sequence may be called a "clone". A host cell that receives the introduced 
DNA or RNA has been "transformed" and is a "transformant" or a "clone." The DNA or RNA 
introduced to a host cell can come from any source, including cells of the same genus or species 
as the host cell, or from cells of a different genus or species. 

[0063] The term "vector" includes a vehicle (e.g., a plasmid) by which a DNA or RNA 
sequence can be introduced into a host cell, so as to transform the host and, optionally, promote 
expression and/or replication of the introduced sequence. 

[0064] Vectors that can be used in this invention include plasmids, viruses, bacteriophage, 
integratable DNA fragments, and other vehicles that may facilitate introduction of the nucleic 



NY02:471 952.1 



-20- 



P36134 074669.0105 



acids into the genome of the host. Plasmids are the most commonly used form of vector but all 
other forms of vectors which serve a similar function and which are, or become, known in the art 
are suitable for use herein. See, e.g., Pouwels, et al., Cloning Vectors: A Laboratory Manual , 
1985 and Supplements, Elsevier, N.Y., and Rodriguez et al. (eds.), Vectors: A Survey of 
Molecular Cloning Vectors and Their Uses , 1988, Buttersworth, Boston, MA. 

[0065] The term "expression system" means a host cell and compatible vector which, under 
suitable conditions, can express a protein or nucleic acid which is carried by the vector and 
introduced to the host cell. Common expression systems include E. coli host cells and plasmid 
vectors, insect host cells and Baculovirus vectors, and mammalian host cells and vectors. 

[0066] Expression of nucleic acids encoding the NPC1L1 polypeptides of this invention can 
be carried out by conventional methods in either prokaryotic or eukaryotic cells. Although E. 
coli host cells are employed most frequently in prokaryotic systems, many other bacteria, such as 
various strains of Pseudomonas and Bacillus, are known in the art and can be used as well. 
Suitable host cells for expressing nucleic acids encoding the NPC1L1 polypeptides include 
prokaryotes and higher eukaryotes. Prokaryotes include both gram-negative and gram-positive 
organisms, e.g., E. coli and B. subtilis. Higher eukaryotes include established tissue culture cell 
lines from animal cells, both of non-mammalian origin, e.g., insect cells, and birds, and of 
mammalian origin, e.g. , human, primates, and rodents. 

[0067] Prokaryotic host-vector systems include a wide variety of vectors for many different 
species. A representative vector for amplifying DNA is pBR322 or many of its derivatives {e.g., 
pUC18 or 19). Vectors that can be used to express the NPC1L1 polypeptides include, but are not 
limited to, those containing the lac promoter (pUC-series); trp promoter (pBR322-/rp); Ipp 
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promoter (the pIN-series); lambda-pP or pR promoters (pOTS); or hybrid promoters such as ptac 
(pDR540). See Brosius et al, "Expression Vectors Employing Lambda-, trp-, lac-, and Ipp- 
derived Promoters", in Rodriguez and Denhardt (eds.) Vectors: A Survey of Molecular Cloning 
Vectors and Their Uses , 1988, Buttersworth, Boston, pp. 205-236. Many polypeptides can be 
expressed, at high levels, in an E.colilTl expression system as disclosed in U.S. Patent Nos. 
4,952,496; 5,693,489 and 5,869,320 and in Davanloo, P., et al, (1984) Proc. Natl. Acad. Sci. 
USA 81: 2035-2039; Studier, F.W., et al, (1986) J. Mol. Biol. 189: 113-130; Rosenberg, A. H., 
et al, (1987) Gene 56: 125-135; and Dunn, J.J., et al, (1988) Gene 68: 259. 

[0068] Higher eukaryotic tissue culture cells may also be used for the recombinant 
production of the NPC1L1 polypeptides of the invention. Although any higher eukaryotic tissue 
culture cell line might be used, including insect baculovirus expression systems, mammalian 
cells are preferred. Transformation or transfection and propagation of such cells have become a 
routine procedure. Examples of useful cell lines include HeLa cells, Chinese hamster ovary 
(CHO) cell lines, J774 cells, Caco2 cells, baby rat kidney (BRK) cell lines, insect cell lines, bird 
cell lines, and monkey (COS) cell lines. Expression vectors for such cell lines usually include an 
origin of replication, a promoter, a translation initiation site, RNA splice sites (if genomic DNA 
is used), a polyadenylation site, and a transcription termination site. These vectors also, usually, 
contain a selection gene or amplification gene. Suitable expression vectors may be plasmids, 
viruses, or retroviruses carrying promoters derived, e.g., from such sources as adenovirus, SV40, 
parvoviruses, vaccinia virus, or cytomegalovirus. Examples of expression vectors include 
pCR®3.1, pCDNAl, pCD (Okayama, et al, (1985) Mol. Cell Biol. 5: 1 136), pMClneo Poly-A 
(Thomas, etal, (1987) Cell 51: 503), pREP8, pSVSPORT and derivatives thereof, and 
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baculovirus vectors such as pAC373 or pAC610. One embodiment of the invention includes 
membrane bound NPC1L1. In this embodiment, NPC1L1 can be expressed in the cell 
membrane of a eukaryotic cell and the membrane bound protein can be isolated from the cell by 
conventional methods which are known in the art. 

[0069] The present invention also includes fusions which include the NPC1L1 polypeptides 
and NPC1L1 polynucleotides of the present invention and a second polypeptide or 
polynucleotide moiety, which may be referred to as a "tag". The fusions of the present invention 
may comprise any of the polynucleotides or polypeptides set forth in Table 1 or any subsequence 
or fragment thereof (discussed above). The fused polypeptides of the invention may be 
conveniently constructed, for example, by insertion of a polynucleotide of the invention or 
fragment thereof into an expression vector. The fusions of the invention may include tags which 
facilitate purification or detection. Such tags include glutathione-S-transferase (GST), 
hexahistidine (His6) tags, maltose binding protein (MBP) tags, haemagglutinin (HA) tags, 
cellulose binding protein (CBP) tags and myc tags. Detectable tags such as 32 P, 35 S, 3 H, 99m Tc, 
123 I, ,n In, 68 Ga, 18 F, 125 I, ,3, I 5 U3m In, 76 Br, 67 Ga, 99m Tc, 123 1, 11 'in and 68 Ga may also be used to 
label the polypeptides and polynucleotides of the invention. Methods for constructing and using 
such fusions are very conventional and well known in the art. 

[0070] Modifications {e.g., post-translational modifications) that occur in a polypeptide often 
will be a function of how it is made. For polypeptides made by expressing a cloned gene in a 
host, for instance, the nature and extent of the modifications, in large part, will be determined by 
the host cell's post-translational modification capacity and the modification signals present in the 
polypeptide amino acid sequence. For instance, as is well known, glycosylation often does not 
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occur in bacterial hosts such as E. coli. Accordingly, when glycosylation is desired, a 
polypeptide can be expressed in a glycosylating host, generally a eukaryotic cell. Insect cells 
often carry out post-translational glycosylations which are similar to those of mammalian cells. 
For this reason, insect cell expression systems have been developed to express, efficiently, 
mammalian proteins having native patterns of glycosylation. An insect cell which may be used 
in this invention is any cell derived from an organism of the class Insecta. Preferably, the insect 
is Spodoptera fruigiperda (Sf9 or Sf21) or Trichoplusia ni (High 5). Examples of insect 
expression systems that can be used with the present invention, for example to produce NPC1L1 
polypeptide, include Bac-To-Bac (Invitrogen Corporation, Carlsbad, CA) or Gateway 
(Invitrogen Corporation, Carlsbad, CA). If desired, deglycosylation enzymes can be used to 
remove carbohydrates attached during production in eukaryotic expression systems. 

[0071] Other modifications may also include addition of aliphatic esters or amides to the 
polypeptide carboxyl terminus. The present invention also includes analogs of the NPC1L1 
polypeptides which contain modifications, such as incorporation of unnatural amino acid 
residues, or phosphorylated amino acid residues such as phosphotyrosine, phosphoserine or 
phosphothreonine residues. Other potential modifications include sulfonation, biotinylation, or 
the addition of other moieties. For example, the NPC1L1 polypeptides of the invention may be 
appended with a polymer which increases the half-life of the peptide in the body of a subject. 
Preferred polymers include polyethylene glycol (PEG) {e.g., PEG with a molecular weight of 2 
kDa, 5 kDa, 10 kDa, 12 kDa, 20 kDa, 30 kDa and 40 kDa), dextran and 
monomethoxypolyethylene glycol (mPEG). 
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[0072] The peptides of the invention may also be cyclized. Specifically, the amino- and 
carboxy-terminal residues of an NPC1L1 polypeptide or two internal residues of an NPC1L1 
polypeptide of the invention can be fused to create a cyclized peptide. Methods for cyclizing 
peptides are conventional and very well known in the art; for example, see Gurrath, et al, (1992) 
Eur. J. Biochem. 210:911-921. 

[0073] The present invention contemplates any superficial or slight modification to the 
amino acid or nucleotide sequences which correspond to the polypeptides of the invention. In 
particular, the present invention contemplates sequence conservative variants of the nucleic acids 
which encode the polypeptides of the invention. "Sequence-conservative variants" of a 
polynucleotide sequence are those in which a change of one or more nucleotides in a given 
codon results in no alteration in the amino acid encoded at that position. Function-conservative 
variants of the polypeptides of the invention are also contemplated by the present invention. 
"Function-conservative variants" are those in which one or more amino acid residues in a protein 
or enzyme have been changed without altering the overall conformation and function of the 
polypeptide, including, but, by no means, limited to, replacement of an amino acid with one 
having similar properties. Amino acids with similar properties are well known in the art. For 
example, polar/hydrophilic amino acids which may be interchangeable include asparagine, 
glutamine, serine, cysteine, threonine, lysine, arginine, histidine, aspartic acid and glutamic acid; 
nonpolar/hydrophobic amino acids which may be interchangeable include glycine, alanine, 
valine, leucine, isoleucine, proline, tyrosine, phenylalanine, tryptophan and methionine; acidic 
amino acids, which may be interchangeable include aspartic acid and glutamic acid and basic 
amino acids, which may be interchangeable include histidine, lysine and arginine. 
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[0074] The present invention includes polynucleotides encoding rat, human or mouse 
NPC1L1 and fragments thereof as well as nucleic acids which hybridize to the polynucleotides. 
Preferably, the nucleic acids hybridize under low stringency conditions, more preferably under 
moderate stringency conditions and most preferably under high stringency conditions. A nucleic 
acid molecule is "hybridizable" to another nucleic acid molecule, such as a cDNA, genomic 
DNA, or RNA, when a single stranded form of the nucleic acid molecule can anneal to the other 
nucleic acid molecule under the appropriate conditions of temperature and solution ionic strength 
(see Sambrook, et al, supra). The conditions of temperature and ionic strength determine the 
"stringency" of the hybridization. Typical low stringency hybridization conditions are 55°C, 5X 
SSC, 0.1% SDS, 0.25% milk, and no formamide at 42°C; or 30% formamide, 5X SSC, 0.5% 
SDS at 42°C. Typical, moderate stringency hybridization conditions are similar to the low 
stringency conditions except the hybridization is carried out in 40% formamide, with 5X or 6X 
SSC at 42°C. High stringency hybridization conditions are similar to low stringency conditions 
except the hybridization conditions are carried out in 50% formamide, 5X or 6X SSC and, 
optionally, at a higher temperature {e.g., higher than 42°C: 57°C, 59°C, 60°C, 62°C, 63°C, 65°C 
or 68°C). In general, SSC is 0.1 5M NaCl and 0.0 15M Na-citrate. Hybridization requires that 
the two nucleic acids contain complementary sequences, although, depending on the stringency 
of the hybridization, mismatches between bases are possible. The appropriate stringency for 
hybridizing nucleic acids depends on the length of the nucleic acids and the degree of 
complementation, variables well known in the art. The greater the degree of similarity or 
homology between two nucleotide sequences, the higher the stringency under which the nucleic 
acids may hybridize. For hybrids of greater than 100 nucleotides in length, equations for 
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calculating the melting temperature have been derived (see Sambrook, et al, supra, 9.50-9.51). 
For hybridization with shorter nucleic acids, i.e., oligonucleotides, the position of mismatches 
becomes more important, and the length of the oligonucleotide determines its specificity (see 
Sambrook, et al., supra). 

[0075] Also included in the present invention are polynucleotides comprising nucleotide 
sequences and polypeptides comprising amino acid sequences which are at least about 70% 
identical, preferably at least about 80% identical, more preferably at least about 90% identical 
and most preferably at least about 95% identical {e.g., 95%, 96%, 97%, 98%, 99%, 100%) to the 
reference rat NPC1L1 nucleotide (e.g., any of SEQ ID NOs: 1 or 5-10) and amino acid sequences 
(e.g., SEQ ID NO: 2), reference human NPC1L1 nucleotide (e.g., SEQ ID NO: 3) and amino 
acid sequences (e.g., SEQ ID NO: 4) or the reference mouse NPC1L1 nucleotide (e.g., any of 
SEQ ID NOs: 11 or 13) and amino acid sequences (e.g., SEQ ID NO: 12), when the comparison 
is performed by a BLAST algorithm wherein the parameters of the algorithm are selected to give 
the largest match between the respective sequences over the entire length of the respective 
reference sequences. Polypeptides comprising amino acid sequences which are at least about 
70% similar, preferably at least about 80% similar, more preferably at least about 90% similar 
and most preferably at least about 95% similar (e.g., 95%, 96%, 97%, 98%, 99%, 100%) to the 
reference rat NPC1L1 amino acid sequence of SEQ ID NO: 2, reference human NPC1L1 amino 
acid sequence of SEQ ID NO: 4 or the reference mouse NPC1L1 amino acid sequence of SEQ 
ID NO: 12, when the comparison is performed with a BLAST algorithm wherein the parameters 
of the algorithm are selected to give the largest match between the respective sequences over the 
entire length of the respective reference sequences, are also included in the present invention. 
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[0076] Sequence identity refers to exact matches between the nucleotides or amino acids of 
two sequences which are being compared. Sequence similarity refers to both exact matches 
between the amino acids of two polypeptides which are being compared in addition to matches 
between nonidentical, biochemically related amino acids. Biochemically related amino acids 
which share similar properties and may be interchangeable are discussed above. 

[0077] The following references regarding the BLAST algorithm are herein incorporated by 
reference: BLAST ALGORITHMS: Altschul, S.F., et al, (1990) J. Mol. Biol. 215: 403-410; 
Gish, W., et al, (1993) Nature Genet. 3: 266-272; Madden, T.L., et al, (1996) Meth. Enzymol. 
266: 131-141; Altschul, S.F., et al, (1997) Nucleic Acids Res. 25: 3389-3402; Zhang, J., et al, 
(1997) Genome Res. 7: 649-656; Wootton, J.C., etal, (1993) Comput. Chem. 17: 149-163; 
Hancock, J.M., et al, (1994) Comput. Appl. Biosci. 10: 67-70; ALIGNMENT SCORING 
SYSTEMS: Dayhoff, M.O., et al, "A model of evolutionary change in proteins" in Atlas of 
Protein Sequence and Structure . (1978) vol. 5, suppl. 3. M.O. Dayhoff (ed.), pp. 345-352, Natl. 
Biomed. Res. Found., Washington, DC; Schwartz, R.M., et al, "Matrices for detecting distant 
relationships" in Atlas of Protein Sequence and Structure , (1978) vol. 5, suppl. 3. M.O. Dayhoff 
(ed.), pp. 353-358, Natl. Biomed. Res. Found., Washington, DC; Altschul, S.F., (1991) J. Mol. 
Biol. 219: 555-565; States, D.J., et al, (1991) Methods 3: 66-70; Henikoff, S., et al, (1992) 
Proc. Natl. Acad. Sci. USA 89: 10915-10919; Altschul, S.F., et al, (1993) J. Mol. Evol. 36: 290- 
300; ALIGNMENT STATISTICS: Karlin, S., et al, (1990) Proc. Natl. Acad. Sci. USA 87: 
2264-2268; Karlin, S., et al, (1993) Proc. Natl. Acad. Sci. USA 90: 5873-5877; Dembo, A., et 
al, (1994) Ann. Prob. 22: 2022-2039; and Altschul, S.F. "Evaluating the statistical significance 
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of multiple distinct local alignments" in Theoretical and Computational Methods in Genome 
Research (S. Suhai, ed.), (1997) pp. 1-14, Plenum, New York. 

Protein Purification 

[0078] The proteins, polypeptides and antigenic fragments of this invention can be purified 
by standard methods, including, but not limited to, salt or alcohol precipitation, affinity 
chromatography (e.g., used in conjunction with a purification tagged NPC1L1 polypeptide as 
discussed above), preparative disc-gel electrophoresis, isoelectric focusing, high pressure liquid 
chromatography (HPLC), reversed-phase HPLC, gel filtration, cation and anion exchange and 
partition chromatography, and countercurrent distribution. Such purification methods are well 
known in the art and are disclosed, e.g., in "Guide to Protein Purification", Methods in 
Enzvmology , Vol. 182, M. Deutscher, Ed., 1990, Academic Press, New York, NY. 

[0079] Purification steps can be followed by performance of assays for receptor binding 
activity as described below. Particularly where an NPC1L1 polypeptide is being isolated from a 
cellular or tissue source, it is preferable to include one or more inhibitors of proteolytic enzymes 
in the assay system, such as phenylmethanesulfonyl fluoride (PMSF), Pefabloc SC, pepstatin, 
leupeptin, chymostatin and EDTA. 

Antibody Molecules 

[0080] Antigenic (including immunogenic) fragments of the NPC1L1 polypeptides of the 
invention are within the scope of the present invention (e.g., 42 or more contiguous amino acids 
from SEQ ID NO: 2, 4 or 12). The antigenic peptides may be useful, inter alia, for preparing 
isolated antibody molecules which recognize NPC1L1. Isolated anti-NPClLl antibody 
molecules are useful NPC1L1 ligands. 
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[0081] An antigen is any molecule that can bind specifically to an antibody. Some antigens 
cannot, by themselves, elicit antibody production. Those that can induce antibody production are 
immunogens. 

[0082] Preferably, isolated anti-NPClLl antibodies recognize an antigenic peptide 
comprising an amino acid sequence selected from SEQ ID NOs: 39-42 (e.g., an antigen derived 
from rat NPC1L1). More preferably, the antibody is A0715, A0716, A0717, A0718, A0867, 
A0868, A1801 orA1802. 

[0083] The term "antibody molecule " includes, but is not limited to, antibodies and 
fragments (preferably antigen-binding fragments) thereof. The term includes monoclonal 
antibodies, polyclonal antibodies, bispecific antibodies, Fab antibody fragments, F(ab)2 antibody 
fragments, Fv antibody fragments (e.g., V H or V L ), single chain Fv antibody fragments and dsFv 
antibody fragments. Furthermore, the antibody molecules of the invention may be fully human 
antibodies, mouse antibodies, rat antibodies, rabbit antibodies, goat antibodies, chicken 
antibodies, humanized antibodies or chimeric antibodies. 

[0084] Although it is not always necessary, when NPC1L1 polypeptides are used as antigens 
to elicit antibody production in an immunologically competent host, smaller antigenic fragments 
are, preferably, first rendered more immunogenic by cross-linking or concatenation, or by 
coupling to an immunogenic carrier molecule (i.e., a macromolecule having the property of 
independently eliciting an immunological response in a host animal, such as diptheria toxin or 
tetanus). Cross-linking or conjugation to a carrier molecule may be required because small 
polypeptide fragments sometimes act as haptens (molecules which are capable of specifically 
binding to an antibody but incapable of eliciting antibody production, i.e., they are not 
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immunogenic). Conjugation of such fragments to an immunogenic carrier molecule renders 
them more immunogenic through what is commonly known as the "carrier effect". 

[0085] Carrier molecules include, e.g., proteins and natural or synthetic polymeric 
compounds such as polypeptides, polysaccharides, lipopolysaccharides, etc. Protein carrier 
molecules are especially preferred, including, but not limited to, keyhole limpet hemocyanin and 
mammalian serum proteins such as human or bovine gammaglobulin, human, bovine or rabbit 
serum albumin, or methylated or other derivatives of such proteins. Other protein carriers will 
be apparent to those skilled in the art. Preferably, the protein carrier will be foreign to the host 
animal in which antibodies against the fragments are to be elicited. 

[0086] Covalent coupling to the carrier molecule can be achieved using methods well known 
in the art, the exact choice of which will be dictated by the nature of the carrier molecule used. 
When the immunogenic carrier molecule is a protein, the fragments of the invention can be 
coupled, e.g., using water-soluble carbodiimides such as dicyclohexylcarbodiimide or 
glutaraldehyde. 

[0087] Coupling agents, such as these, can also be used to cross-link the fragments to 
themselves without the use of a separate carrier molecule. Such cross-linking into aggregates 
can also increase immunogenicity. Immunogenicity can also be increased by the use of known 
adjuvants, alone or in combination with coupling or aggregation. 

[0088] Adjuvants for the vaccination of animals include, but are not limited to, Adjuvant 65 
(containing peanut oil, mannide monooleate and aluminum monostearate); Freund's complete or 
incomplete adjuvant; mineral gels such as aluminum hydroxide, aluminum phosphate and alum; 
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surfactants such as hexadecylamine, octadecylamine, lysolecithin, 
dimethyldioctadecylammonium bromide, N,N-dioctadecyl-N',N'-bis(2-hydroxymethyl) 
propanediamine, methoxyhexadecylglycerol and pluronic polyols; polyanions such as pyran, 
dextran sulfate, poly IC, polyacrylic acid and carbopol; peptides such as muramyl dipeptide, 
dimethyl glycine and tuftsin; and oil emulsions. The polypeptides could also be administered 
following incorporation into liposomes or other microcarriers. 

[0089] Information concerning adjuvants and various aspects of immunoassays are disclosed, 
e.g., in the series by P. Tijssen, Practice and Theory of Enzyme Immunoassays , 3rd Edition, 
1987, Elsevier, New York. Other useful references covering methods for preparing polyclonal 
antisera include Microbiology , 1969, Hoeber Medical Division, Harper and Row; Landsteiner, 
Specificity of Serological Reactions . 1962, Dover Publications, New York, and Williams, et al, 
Methods in Immunology and Immunochemistry , Vol. 1, 1967, Academic Press, New York. 

[0090] The anti-NPClLl antibody molecules of the invention preferably recognize human, 
mouse or rat NPC1L1; however, the present invention includes antibody molecules which 
recognize NPC1L1 from any species, preferably mammals (e.g., cat, sheep or horse). The 
present invention also includes complexes comprising an NPC1L1 polypeptide of the invention 
and an anti-NPClLl antibody molecule. Such complexes can be made by simply contacting the 
antibody molecule with its cognate polypeptide. 

[0091] Various methods may be used to make the antibody molecules of the invention. 
Human antibodies can be made, for example, by methods which are similar to those disclosed in 
U.S. Patent Nos. 5,625,126; 5,877,397; 6,255,458; 6,023,010 and 5,874,299. 
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[0092] Hybridoma cells which produce the monoclonal anti-NPClLl antibodies may be 
produced by methods which are commonly known in the art. These methods include, but are not 
limited to, the hybridoma technique originally developed by Kohler, et al, (1975) (Nature 256: 
495-497), as well as the trioma technique (Hering, et al, (1988) Biomed. Biochim. Acta. 47: 
21 1-216 and Hagiwara, et al, (1993) Hum. Antibod. Hybridomas 4: 15), the human B-cell 
hybridoma technique (Kozbor, et al, (1983) Immunology Today 4: 72 and Cote, et al, (1983) 
Proc. Natl. Acad. Sci. U.S.A 80: 2026-2030), and the EBV-hybridoma technique (Cole, et al, in 
Monoclonal Antibodies and Cancer Therapy . Alan R. Liss, Inc., pp. 77-96, 1985). ELISA may 
be used to determine if hybridoma cells are expressing anti-NPClLl antibodies. 

[0093] The anti-NPClLl antibody molecules of the present invention may also be produced 
recombinantly {e.g., in an E.colilll expression system as discussed above). In this embodiment, 
nucleic acids encoding the antibody molecules of the invention (e.g., V H or V L ) may be inserted 
into a pet-based plasmid and expressed in the E.colilll system. There are several methods by 
which to produce recombinant antibodies which are known in the art. An example of a method 
for recombinant production of antibodies is disclosed in U.S. Patent No. 4,816,567. See also 
Skerra, A., et al, (1988) Science 240: 1038-1041; Better, M., et al, (1988) Science 240: 1041- 
1043 and Bird, R.E., et al, (1988) Science 242: 423-426. 

[0094] The term "monoclonal antibody," includes an antibody obtained from a population of 
substantially homogeneous antibodies, i.e., the individual antibodies comprising the population 
are identical except for possible, naturally occurring mutations that may be present in minor 
amounts. Monoclonal antibodies are highly specific, being directed against a single antigenic 
site. Monoclonal antibodies are advantageous in that they may be synthesized by a hybridoma 
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culture, essentially uncontaminated by other immunoglobulins. The modifier "monoclonal" 
indicates the character of the antibody as being obtained from a substantially homogeneous 
population of antibodies, and is not to be construed as requiring production of the antibody by 
any particular method. The monoclonal antibodies to be used in accordance with the present 
invention may be made by the hybridoma method as described by Kohler, et al, (1975) Nature 
256: 495. 

[0095] The term "polyclonal antibody" includes an antibody which was produced among or 
in the presence of one or more other, non-identical antibodies. In general, polyclonal antibodies 
are produced from a B-lymphocyte in the presence of several other B-lymphocytes which 
produced non-identical antibodies. Typically, polyclonal antibodies are obtained directly from 
an immunized animal (e.g., a rabbit). 

[0096] A "bispecific antibody" comprises two different antigen binding regions which bind 
to distinct antigens. Bispecific antibodies, as well as methods of making and using the 
antibodies, are conventional and very well known in the art. 

[0097] Anti-idiotypic antibodies or anti-idiotypes are antibodies directed against the antigen- 
combining region or variable region (called the idiotype) of another antibody molecule. As 
disclosed by Jeme (Jerne, N. K., (1974) Ann. Immunol. (Paris) 125c: 373 and Jerne, N. K., et al, 
(1982) EMBO 1 : 234), immunization with an antibody molecule expressing a paratope (antigen- 
combining site) for a given antigen (e.g., NPC1L1) will produce a group of anti-antibodies, some 
of which share, with the antigen, a complementary structure to the paratope. Immunization with 
a subpopulation of the anti-idiotypic antibodies will, in turn, produce a subpopulation of 
antibodies or immune cell subsets that are reactive to the initial antigen. 
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[0098] The term "fully human antibody" refers to an antibody which comprises human 
immunoglobulin sequences only. Similarly, "mouse antibody" refers to an antibody which 
comprises mouse immunoglobulin sequences only. 

[0100] "Human/mouse chimeric antibody" refers to an antibody which comprises a mouse 
variable region (Vh and Vl) fused to a human constant region. 

[0101] "Humanized" anti-NPClLl antibodies are also within the scope of the present invention. 
Humanized forms of non-human (e.g., murine) antibodies are chimeric immunoglobulins, which 
contain minimal sequence derived from non-human immunoglobulin. For the most part, 
humanized antibodies are human immunoglobulins (recipient antibody) in which residues from a 
complementary determining region of the recipient are replaced by residues from a 
complementary determining region of a nonhuman species (donor antibody), such as mouse, rat 
or rabbit, having a desired specificity, affinity and capacity. In some instances, Fv framework 
residues of the human immunoglobulin are also replaced by corresponding non-human residues. 

[0102] "Single-chain Fv" or "sFv" antibody fragments include the V H and/or V L domains of an 
antibody, wherein these domains are present in a single polypeptide chain. Generally, the sFv 
polypeptide further comprises a polypeptide linker between the Vh and V L domains which 
enables the sFv to form the desired structure for antigen binding. Techniques described for the 
production of single chain antibodies (U.S. Patent Nos. 5,476,786; 5,132,405 and 4,946,778) can 
be adapted to produce anti-NPClLl specific, single chain antibodies. For a review of sFv see 
Pluckthun in The Pharmacology of Monoclonal Antibodies , vol. 1 13, Rosenburg and Moore, 
eds., Springer- Verlag, N.Y., pp. 269-315 (1994). 
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[0103] "Disulfide stabilized Fv fragments" and "dsFv" include molecules having a variable 
heavy chain (V H ) and/or a variable light chain (V L ) which are linked by a disulfide bridge. 

[0104] Antibody fragments within the scope of the present invention also include F(ab)2 
fragments which may be produced by enzymatic cleavage of an IgG by, for example, pepsin. 
Fab fragments may be produced by, for example, reduction of F(ab) 2 with dithiothreitol or 
mercaptoethylamine. 

[0105] An FV fragment is a V L or V H region. 

[0106] Depending on the amino acid sequences of the constant domain of their heavy chains, 
immunoglobulins can be assigned to different classes. There are at least five major classes of 
immunoglobulins: IgA, IgD, IgE, IgG and IgM, and several of these may be further divided into 
subclasses (isotypes), e.g., IgG-1, IgG-2, IgG-3 and IgG-4; IgA-1 and IgA-2. 

[0107] The anti-NPClLl antibody molecules of the invention may also be conjugated to a 
chemical moiety. The chemical moiety may be, inter alia, a polymer, a radionuclide or a 
cytotoxic factor. Preferably, the chemical moiety is a polymer which increases the half-life of 
the antibody molecule in the body of a subject. Suitable polymers include, but are by no means 
limited to, polyethylene glycol (PEG) (e.g., PEG with a molecular weight of 2kDa, 5kDa, 
lOkDa, 12kDa, 20kDa, 30kDa or 40kDa), dextran and monomethoxypolyethylene glycol 
(mPEG). Methods for producing PEGylated anti-IL8 antibodies which are described in U.S. 
Patent No. 6,133,426 can be applied to the production of PEGylated anti-NPClLl antibodies of 
the invention. Lee, et al, (1999) (Bioconj. Chem. 10: 973-981) discloses PEG conjugated 
single-chain antibodies. Wen, et al, (2001) (Bioconj. Chem. 12: 545-553) discloses conjugating 
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antibodies with PEG which is attached to a radiometal chelator (diethylenetriaminpentaacetic 
acid (DTP A)). 

[0108] The antibody molecules of the invention may also be conjugated with labels such as 99 Tc, 
90 Y, '"In, 32 P 5 ,4 C, 125 I, 3 H, 131 I, n C, ,5 0, ,3 N, 18 F, 35 S, 5, Cr, 57 To, 226 Ra, 60 Co, 59 Fe, 57 Se, 152 Eu, 
67 CU, 217 Ci, 21 'At, 212 Pb, 47 Sc, 109 Pd, 234 Th, 40 K, 157 Gd, 55 Mn, 52 Tr or 56 Fe. 

[0109] The antibody molecules of the invention may also be conjugated with fluorescent or 
chemilluminescent labels, including fluorophores such as rare earth chelates, fluorescein and its 
derivatives, rhodamine and its derivatives, isothiocyanate, phycoerythrin, phycocyanin, 
allophycocyanin, o-phthaladehyde, fluorescamine, 152 Eu, dansyl, umbelliferone, luciferin, 
luminal label, isoluminal label, an aromatic acridinium ester label, an imidazole label, an 
acridimium salt label, an oxalate ester label, an aequorin label, 2,3-dihydrophthalazinediones, 
biotin/avidin, spin labels and stable free radicals. 

[0110] The antibody molecules may also be conjugated to a cytotoxic factor such as diptheria 
toxin, Pseudomonas aeruginosa exotoxin A chain, ricin A chain, abrin A chain, modeccin A 
chain, alpha-sarcin, Aleurites fordii proteins and compounds {e.g., fatty acids), dianthin proteins, 
Phytoiacca americana proteins PAPI, PAPII, and PAP-S, momordica charantia inhibitor, curcin, 
crotin, saponaria officinalis inhibitor, mitogellin, restrictocin, phenomycin, and enomycin. 

[0111] Any method known in the art for conjugating the antibody molecules of the invention to 
the various moieties may be employed, including those methods described by Hunter, et al., 
(1962) Nature 144: 945; David, et al, (1974) Biochemistry 13: 1014; Pain, etal, (1981) J. 
Immunol. Meth. 40: 219; and Nygren, J., (1982) Histochem. and Cytochem. 30: 407. 
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[0112] Methods for conjugating antibodies are conventional and very well known in the art. 

Screening Assays 

[0113] The invention allows the identification of selective ligands of NPC1L1 (e.g., SEQ ID 
NO: 2, 4 or 12) that may be useful in treatment and management of a variety of medical 
conditions, including elevated serum sterol (e.g., cholesterol) or 5a-stanol. Thus, NPC1L1 of 
this invention can be employed in screening systems to identify ligands. These ligands may be 
agonists or antagonists of NPC1L1. Essentially, these assays provide methods for identifying 
ligands of NPC1L1 by using (1) NPC1L1, (2) an appropriate known NPC1L1 ligand, agonist or 
antagonist, for example, a sterol (such as cholesterol, phytosterols, including, but not limited to, 
sitosterol, campesterol, stigmasterol and avenosterol), a cholesterol oxidation product, a 5a- 
stanol (including, but not limited to, cholestanol, 5<x-campestanol and 5ct-sitostanol), a 
substituted azetidinone (e.g., ezetimibe), BODIPY-ezetimibe (Altmann, et al., (2002) Biochim. 
Biophys. Acta 1580(1): 77-93) or 4", 6"-bis[(2-fluorophenyl)carbamoyl]-beta-D-cellobiosyl 
derivative of 11-ketotigogenin as described in DeNinno, et al, (1997) (J. Med. Chem. 40(16): 
2547-54) or any substituted azetidinone, and (3) a sample to be tested for the presence of a 
candidate NPC1L1 ligand. 

[0114] The term "specific" when used to describe binding of, for example, a ligand of 
NPC1L1 in a screening assay is a term of art which refers to the extent by which the ligand or 
antagonist (e.g., substituted azetidinone, ezetimibe, sterol (such as cholesterol) or 5oc-stanol) 
binds preferentially to NPC1L1 in comparison to other proteins in the assay system. For 
example, detection of the specific binding of a ligand of NPC1L1 binds specifically to NPC1L1 
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is made apparent when a signal generated in the assay to indicate such binding exceeds, to any 
extent, a signal generated in a negative control wherein, for example, NPC1L1 or ligand is 
absent. Furthermore, "specific binding" includes binding of a ligand either directly to NPC1L1 
or indirectly, for example via another moiety, in a complex of which NPC1L1 is a part. The 
moiety to which an NPC1L1 ligand binds can be another protein or a post-translational 
modification of NPC1L1 (e.g., a lipid chain or a carbohydrate chain). 

[0115] Non-limiting examples of suitable substituted azetidinones for use in the screening 
assays include those disclosed in U.S. Patent Nos. RE37.721; 5,631,365; 5,767,1 15; 5,846,966; 
5,688,990; 5,656,624; 5,624,920; 5,698,548; 5,756,470; 5,688,787; 5,306,817; 5,633,246; 
5,627,176; 5,688,785; 5,744,467; 5,846,966; 5,728,827; 6,632,933 and U.S. Patent Application 
Publication No 2003/0105028-each of which is herein incorporated by reference in its entirety. 

[01 16] The present invention provides for a method by which to evaluate whether a sample 
contains an NPC1L1 ligand by determining whether the sample contains a candidate compound 
which competes for binding between the known ligand (e.g., ezetimibe) and NPC1L1. The 
ligand may be an agonist or antagonist. 

[0117] Ezetimibe can be prepared by a variety of methods well know to those skilled in the 
art, for example such as are disclosed in U.S. Patents Nos. 5,631,365, 5,767,1 15, 5,846,966, 
6,207,822, U.S. Patent Application Publication No. 2002/0193607 and PCT Patent Application 
WO 93/02048, each of which is incorporated herein by reference in its entirety. 

[0118] "Sample", "candidate compound" or "candidate substance" refers to a compound or 
composition which is evaluated in a test or assay, for example, for the ability to bind to NPC1L1 

NY02:471952.1 -39- 



P36134 074669.0105 



(e.g., SEQ ID NO: 2, 4 or 12) or a functional fragment thereof. The composition may comprise 
candidate compounds, such as small molecules, peptides, nucleotides, polynucleotides, 
subatomic particles (e.g., a particles, (3 particles) or antibodies. 

[01 19] The present invention provides methods for identifying ligands of a compound that 
binds to NPC1L1 which involve contacting NPC1L1 with a detectably labeled substituted 2- 
azetidinone, preferably substituted 2-azetidinone-glucuronide, and a candidate compound, and 
determining whether the candidate compound binds to NPC1L1. The modulation of the binding 
of the substituted 2-azetidinone to NPC1L1 by the binding of the candidate compound to 
NPC1L1 indicates that the candidate compound is a ligand that binds to NPC1L1 and is an 
inhibitor of sterol and 5a-stanol absorption. 

[0120] The present invention also provides a method for identifying a ligand of NPC1L1 
comprising contacting NPC1L1 with a detectably labeled substituted 2-azetidinone, preferably 
substituted 2-azetidinone-glucuronide, and measuring the binding of NPC1L1 of the detectably 
labeled substituted 2-azetidinone in the presence and absence of a candidate compound, wherein 
decreased binding of the detectably labeled substituted 2-azetidinone to the NPC1L1 in the 
presence of the candidate compound indicates that said candidate compound is a ligand of 
NPC1L1 and is an inhibitor of sterol and 5cc-stanol absorption. 

[0121] Preferably, the substituted 2-azetidinones are detectably labelled with 3 H, 35 S, 125 I, or 
a fluorescently labeled substituted 2-azetidinone, preferably substituted 2-azetidinone- 
glucuronide. Compounds that are substituted 2-azetidinone-glucuronides are those having the 
following structure (I): 
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OH 




(i) 

wherein X represents a group that links the glucuronide to the 4-phenyl ring, for example but not 
limited to -0- or -C1.3 alkyl-, X represents an optionally substituted -alkanediyl-, and wherein 
any of the phenyl groups may be optionally substituted. Examples of the phenyl-X 2 -moiety in 
structure (I) include those represented at the 4-position on the 2-azetidinone structure shown 
below in structure (II). Additional examples of substituted 2-azetidinone-glucuronides include 
but are not limited to those described in U.S. Patent No. 5,756,470, WO02/066464 and US 
2002/0137689. Additional examples of substituted 2-azetidinone-glucuronide compounds 
include those having the structure (II) and pharmaceutically acceptable salts and esters thereof as 
follows: 
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R R 2 
Ap ^-(X)Fr(C)q-(Y)n-(C) r -(Z) | 
R 1 R 3 



-r- R' 



,9 




(II) 



wherein: 

Arl is selected from the group consisting of aryl and R.4 -substituted aryl; 
X, Y and Z are independently selected from the group consisting of -CH2-, -CH(Ci-6alkyl)- and 
-C(Ci- 6 alkyl)2-; 

R is selected from the group consisting of -OR6, -0(CO)R 6 , -0(CO)OR 9 , 

-0(CO)NR 6 R 7 , a sugar residue, a disugar residue, a trisugar residue and a tetrasugar residue; 
Rl is selected from the group consisting of -H, -Ci-6alkyl and aryl, or R and Rl together 



R2 is selected from the group consisting of -OR6 -0(CO)R 6 , -0(CO)OR 9 and -0(CO)NR 5 R 7 ; 



are oxo; 



R3 is selected from the group consisting of -H, -Ci-6alkyl and aryl or R2 and R3 together are 



oxo; 



q, r and t are each independently selected from 0 and 1 ; 



m, n and p are each independently selected from 0, 1, 2, 3 and 4; 
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R.4 is 1-5 substituents independently selected at each occurrence from the group consisting of: 
-OR5, -0(CO)R5, -0(CO)OR8, -0-Ci_5alkyl-OR5, -0(CO)NR5r6, -NR5r6, -NR5(C0)R6, 
-NR5(CO)OR8, -NR5(CO)NR6r7, -NR5S02R8, -COOR5, -CONR5R6, -COR5, 
-S02NR5R6, -S(0) t R8 -O-Ci-i0alkyl-COOR5, -0-Ci-ioalkyl-CONR5R6 and fluoro; 

R5, R6 and R7 are independently selected at each occurrence from the group consisting of -H, 
Ci -6alkyl, aryl and aryl-substituted Ci-6alkyl; 

R8 is independently selected from the group consisting of Ci -6alkyl, aryl and aryl-substituted 
Ci_6alkyl; 

R9 is selected from the group consisting of -C=C-CH2-NR10r1 1, -OC-C(0)Rl3, 
and-(CH2)3-NRlORl4 ; 

RlO is independently selected at each occurrence from -H and -Ci -3alkyl; 

Rl 1 is selected from the group consisting of -H, - Ci -3alkyl, -C(0)-Ci-3alkyl, -C(O)- 

NRIOrIO, -S02-Ci_3alkyl,and-S02-phenyl; and 

Rl2is selected from 

QH OH 





C0 2 H 



and 



C0 2 Me 



(referred to herein as "glucuronide") 



(referred to herein as "methyl ester glucuronide"); 



Rl3 is selected from the group consisting of -OH and -NRIOrI 1; and 
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R.14 is selected from the group consisting of -C(0)-Ci-3alkyl, -C(O)-NR10r10 5 
-S02-Ci-3alkyl and -S02-phenyl. 

[0122] The above compound (II) is described in U.S. Provisional Application No. , 

filed Dec 13,2003. 

[0123] Two additional types of screening systems that can be used include a labeled-ligand 
binding assay (e.g., direct binding assay or scintillation proximity assay (SPA)) and a "sterol 
(e.g., cholesterol) or 5a-stanol uptake" assay. A labeled ligand, for use in the binding assay, can 
be obtained by labeling a sterol (e.g., cholesterol) or a 5<x-stanol or a known NPC1L1 agonist or 
antagonist with a measurable group (e.g., 35 S, 125 I or 3 H). Various labeled forms of sterols (e.g., 
cholesterol) or 5a-stanols are available commercially or can be generated using standard 
techniques (e.g., Cholesterol- [1,2- 3 H(N)], Cholesterol-[l,2,6,7- 3 H(N)] or Cholesterol-[7- 3 H(N)]; 
American Radiolabeled Chemicals, Inc; St. Louis, MO). In a preferred embodiment, ezetimibe 
is fluorescently labeled with a BODIPY group (Altmann, et al, (2002) Biochim. Biophys. Acta 
1580(1): 77-93) or labeled with a detectable group such as 35 S, 125 I or 3 H. 

[0124] Direct Binding Assay. Typically, a given amount of NPC1L1 of the invention (e.g., 
SEQ ID NO: 2, 4 or 12) or a complex including NPC1L1 is contacted with increasing amounts of 
labeled ligand or known antagonist or agonist (discussed above) and the amount of the bound, 
labeled ligand or known antagonist or agonist is measured after removing unbound, labeled 
ligand or known antagonist or agonist by washing. As the amount of the labeled ligand or 
known agonist or antagonist is increased, a point is eventually reached at which all receptor 
binding sites are occupied or saturated. Specific receptor binding of the labeled ligand or known 
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agonist or antagonist is abolished by a large excess of unlabeled ligand or known agonist or 
antagonist. 

[0125] Preferably, an assay system is used in which non-specific binding of the labeled 
ligand or known antagonist or agonist to the receptor is minimal. Non-specific binding is 
typically less than 50%, preferably less than 15%, and more preferably less than 10% of the total 
binding of the labeled ligand or known antagonist or agonist. 

[0126] In the basic binding assay, the method for identifying an NPC1L1 ligand, agonist or 
antagonist includes: 

(a) contacting NPC 1 LI (e.g. , SEQ U) NO: 2 or 4 or 1 2), a fragment thereof or 
a complex including NPC 1 LI, in the presence of a known amount of labeled sterol (e.g., 
cholesterol) or 5a-stanol or known antagonist or agonist (e.g., labeled ezetimibe) with a sample 
to be tested for the presence of an NPC 1 LI ligand, agonist or antagonist; and 

(b) measuring the amount of labeled sterol (e.g. , cholesterol) or 5a-stanol or 
known antagonist or agonist directly or indirectly bound to NPC1L1. 

[0127] An NPC1 LI ligand in the sample is identified by measuring substantially reduced 
direct or indirect binding of the labeled sterol (e.g., cholesterol) or 5a-stanol or known antagonist 
or agonist to NPC 1 LI, compared to what would be measured in the absence of such a ligand. 
For example, reduced direct or indirect binding between [ 3 H] -cholesterol and NPC1L1 in the 
presence of a sample might suggest that the sample contains a substance which is competing 
against [ 3 H] -cholesterol for NPC 1 LI binding. 
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[0128] This assay can include a control experiment lacking any NPC1 LI -dependent ligand 
{e.g., sterol such as cholesterol or 5a-stanol) binding. In this assay, for example, a whole cell or 
cell membrane lacking any functional NPC1L1, for example, a cell or membrane isolated or 
derived from a transgenic mutant npcllV mouse of the invention, is assayed for ligand binding. 
When screening a sample for the presence of an NPC1L1 antagonist, it is useful to compare the 
level of binding observed in the presence of a sample being tested with that of a control 
experiment, as described herein, which completely lacks NPC1 LI -dependent binding. Ideally, 
though by no means necessarily, the level of binding seen in the presence of a sample containing 
an antagonist will be similar to that of the control experiment. 

[0129] Alternatively, a sample can be tested directly for binding to NPC1L1 (e.g., SEQ ID 
NO: 2, 4 or 12). A basic assay of this type may include the following steps: 

(a) contacting NPC1L1 (e.g., SEQ ID NO: 2 or 4 or 12), a fragment thereof or 
a complex including NPC1L1 with a labeled candidate compound (e.g., [ 3 H]-ezetimibe); and 

(b) detecting direct or indirect binding between the labeled candidate 
compound and NPC1L1. 

[0130] Again, these experiment can be performed along with a control experiment wherein 
NPC1 LI -dependent binding is completely lacking. For example, the assay can be performed 
using a whole cell or cell membrane lacking any functional NPC1L1 (e.g., cell or cell membrane 
derived from a transgenic, mutant npcllV mouse as described herein). 
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[0131] A candidate compound which is found to bind to NPC1L1 may function as ligand, 
agonist or antagonist of NPC1L1 (e.g., by inhibition of sterol (e.g., cholesterol) or 5a-stanol 
uptake). 

[0132] SPA Assay. NPC1L1 ligands may also be measured using scintillation proximity 
assays (SPA). SPA assays are conventional and very well known in the art; see, for example, 
U.S. Patent No. 4,568,649. In SPA, the target of interest is immobilized to a small microsphere 
approximately 5 microns in diameter. The microsphere, typically, includes a solid scintillant 
core which has been coated with a polyhydroxy film, which in turn contains coupling molecules, 
which allow generic links for assay design. When a radioisotopically labeled molecule binds to 
the microsphere, the radioisotope is brought into close proximity to the scintillant and effective 
energy transfer from electrons emitted by the isotope will take place resulting in the emission of 
light. While the radioisotope remains in free solution, it is too distant from the scintillant and the 
electron will dissipate the energy into the aqueous medium and therefore remain undetected. 
Scintillation may be detected with a scintillation counter. In general, 3 H and 125 I labels are well 
suited to SPA. 

[0133] For the assay of receptor-mediated binding events, the lectin wheat germ agglutinin 
(WGA) may be used as the SPA bead coupling molecule (Amersham Biosciences; Piscataway, 
NJ). The WGA coupled bead captures glycosylated, cellular membranes and glycoproteins and 
has been used for a wide variety of receptor sources and cultured cell membranes. The receptor 
is immobilized onto the WGA-SPA bead and a signal is generated on binding of an isotopically 
labeled ligand. Other coupling molecules which may be useful for receptor binding SPA assays 
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include poly-L-lysine and WGA/polyethyleneimine (Amersham Biosciences; Piscataway, NJ). 
See, for example, Berry, J.A., et al., (1991) Cardiovascular Pharmacol. 17 (Suppl.7): S143-S145; 
Hoffman, R., et al, (1992) Anal. Biochem. 203: 70-75; Kienhus, et al, (1992) J. Receptor 
Research 12: 389-399; Jing, S., et al, (1992) Neuron 9: 1067-1079. 

[0134] The scintillant contained in SPA beads may include, for example, yttrium silicate 
(YSi), yttrium oxide (YOx), diphenyloxazole or polyvinyltoluene (PVT) which acts as a solid 
solvent for diphenylanthracine (DP A). 

[0135] SPA assays may be used to analyze whether a sample contains an NPC1 LI ligand. In 
these assays, a host cell which expresses NPC1L1 {e.g., SEQ ID NO: 2 or 4 or 12) on the cell 
surface or a membrane fraction thereof is incubated with and captured by SPA beads {e.g., WGA 
coated YOx beads or WGA coated YSi beads). The beads bearing the NPC1L1 are incubated 
with labeled, known ligand or agonist or antagonist {e.g., 3 H-cholesterol, 3 H-ezetimibe or 125 I- 
ezetimibe). The assay mixture further includes either the sample to be tested or a blank {e.g., 
water). After an optional incubation, scintillation is measured using a scintillation counter. An 
NPC1L1 ligand, agonist or antagonist may be identified in the sample by measuring substantially 
reduced fluorescence, compared to what would be measured in the absence of such ligand, 
agonist or antagonist (blank). Measuring substantially reduced fluorescence may suggest that the 
sample contains a substance which competes for direct or indirect NPC1L1 binding with the 
known ligand, agonist or antagonist. 

[0136] Alternatively, a sample may be identified as an ligand of NPC1L1 by directly 
detecting binding in a SPA assay. In this assay, a labeled version of a candidate compound to be 
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tested may be put in contact with the host cell expressing NPC1L1 or a membrane fraction 
thereof which is bound to the SPA bead. Fluorescence may then be assayed to detect the 
presence of a complex between the labeled candidate compound and the host cell or membrane 
fraction expressing NPC1L1 or a complex including NPC1L1. A candidate compound which 
binds directly or indirectly to NPC1L1 may possess NPC1L1 agonistic or antagonistic activity. 

[0137] SPA Assays can also be performed along with a control experiment lacking any 
NPC1 LI -dependent binding. The control experiment can be performed, for example, with a cell 
or cell membrane lacking any functional NPC1L1 (e.g., cell or cell membrane derived from a 
transgenic, mutant npclll- mouse as described herein). When the control experiment is 
performed, the level of binding observed in the presence of sample being tested for the presence 
of an antagonist can be compared with that observed in the control experiment. 

[0138] Sterol/5a-stanol Uptake Assay. Assays may also be performed to determine if a 
sample can agonize or antagonize NPC1L1 mediated sterol (e.g., cholesterol) or 5a-stanol 
uptake. In these assays, a host cell expressing NPC1L1 (e.g., SEQ ID NO: 2 or 4 or 12) on the 
cell surface (discussed above) can be contacted with detectably labeled sterol (e.g., 3 H- 
cholesterol or 125 I-cholesterol)) or 5a-stanol along with either a sample or a blank. After an 
optional incubation, the cells can be washed to remove unabsorbed sterol or 5a-stanol. Sterol or 
5a-stanol uptake can be determined by detecting the presence of labeled sterol or 5a-stanol in 
the host cells. For example, assayed cells or lysates or fractions thereof (e.g., fractions resolved 
by thin-layer chromatography) can be contacted with a liquid scintillant and scintillation can be 
measured using a scintillation counter. 



NY02:47 1952.1 



-49- 



P36134 074669.0105 



[0139] In these assays, an NPC1L1 antagonist in the sample may be identified by measuring 
substantially reduced uptake of labeled sterol (e.g., H-cholesterol) or 5a-stanol, compared to 
what would be measured in the absence of such an antagonist and an agonist may be identified 
by measuring substantially increased uptake of labeled sterol (e.g., 3 H-cholesterol) or 5oc-stanol, 
compared to what would be measured in the absence of such an agonist. 

[0140] Uptake assays can also be performed along with a control experiment lacking any 
NPCl LI -dependent uptake. The control experiment can be performed, for example, with a cell 
lacking any functional NPC1L1 (e.g., cell derived from a transgenic, mutant npclll' mouse as 
described herein). When the control experiment is performed, the level of uptake observed in the 
presence of sample being tested for the presence of an antagonist can be compared with that 
observed in the control experiment. 

[0141] Source ofNPClLl. In principle, a binding assay of the invention could be 

carried out using a soluble NPC1L1 polypeptide of the invention, e.g., following production and 
refolding by standard methods from an E. coli or other prokaryotic or eukaryotic expression 
system, and the resulting receptor-labeled ligand complex could be precipitated, e.g., using an 
antibody against the receptor. The precipitate could then be washed and the amount of the 
bound, labeled ligand or antagonist or agonist could be measured. 

[0142] Alternatively, a nucleic acid encoding an NPCl LI polypeptide of the invention (e.g., 
SEQ ID NO: 2, 4 or 12) can be transfected into an appropriate host cell, whereby the NPC1L1 
will become incorporated into the membrane of the cell. A membrane fraction can then be 
isolated from the cell and used as a source of NPC1L1 for assay. Alternatively, the whole cell 
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expressing NPC1L1 in the cell surface can be used in an assay. Preferably, specific binding of 
the labeled ligand or known antagonist or agonist to an untransfected/untransformed host cell or 
to a membrane fraction from an untransfected/untransformed host cell will be negligible. 

[0143] Various membranes may be used directly as a source of NPC1L1 for the above- 
described screening systems, e.g. direct binding, scintillation proximity assay, sterol/5a stanol 
uptake assay. As described in Examples 5, 6, 7, 8, 9, 17, 27, and 29, NPC1L1 is highly 
expressed in certain tissues, especially in brush border cells of intestinal tissues. Therefore, brush 
border membrane vesical preparations may be utilized as a source of NPC1L1 . The membranes 
may be derived from mammalian intestinal tissue from rhesus, rat, mouse or human tissue. 

[0144] Membranes may be derived from brush border cells of intestinal tissues. Such 
membranes are conventionally prepared by collecting intestinal tissue from freshly sacrificed 
animals. The mucosa of the tissue is scraped, collected into buffered solutions, and 
homogenized. Cellular debris is removed and the membrane fractions are collected by 
centrifugation. Conventional techniques known to one of skill in the art maybe used for the 
preparation of brush border membrane vesicules. See Hauser, H., Howell, K., Dawson, R.M.C., 
Bowyer, D. E. Biochim. Biophys. Acta 602, 567-577 (1980); Kramer, W., Girbig, F., Gutjahr, 
U., Kowalewski, S., Jouvenal, K., Muller, G., Tripier, D., Wess, G. J. Biol. Chem. 268, 18035- 
18046 (1993); Rigtrup, K.M., Ong, D.E. Biochemistry 3_L, 2920-2926 (1992). 

[0145] The membrane preparation may be in vesicular or non-vesicular form. 
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[0146] Alternatively, liposomes and liposomal preparations comprising NPC1L1 may also 
be a viable source of NPC1L1 for the screening assays of the present claimed method. 

[0147] In vitro cultured cells expressing NPC1L1 may also be used. The host cells may be 
prepared by transforming or transfecting a nucleic acid encoding an NPC1L1 of the invention 
into an appropriate host cell, whereby the receptor becomes incorporated into the membrane of 
the cell. A membrane fraction can then be isolated from the cell and used as a source of the 
receptor for assay. Alternatively, the whole cell expressing the receptor on the cell surface can 
be used in an assay. Preferably, specific binding of the labeled ligand or known antagonist or 
agonist to an untransfected/untransformed host cell or membrane fraction from an 
untransfected/untransformed host cell will be negligible. 

[0148] Preferred host cells include Chinese Hamster Ovary (CHO) cells, murine macrophage 
J774 cells or any other macrophage cell line and human intestinal epithelial Caco2 cells. 

[0149] The present invention provides for a method of identifying a ligand of NPC1L1 using 
these membrane preparations, for example by contacting membranes comprising NPC1L1, such 
as brush border membrane vesicle preparations, with detectably labeled substituted azetidinone 
compounds which are known NPC1L1 ligands, agonists or antagonists, and a candidate 
compound and determining whether the candidate compound can bind to NPC1L1. The binding 
of the candidate compound to NPC1L1 may modulate binding of the detectably labeled NPC1L1 
ligands, agonists or antagonists to NPC1L1. In addition, a NPC1L1 ligand may be identified by 
measuring the binding of NPC1L1 with detectably labeled NPC1L1 ligands, agonists or 
antagonists in the presence and absence of the candidate compound wherein decreased binding 
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of the detectably labeled NPC1L1 ligands, agonists or antagonists to NPC1L1 is an indication 
that the candidate compound is ligand of NPC1L1. 

[0150] NPC1L1 may also be obtained by solubilization of membrane fractions comprising 
NPC1L1. The membranes may be obtained as discussed above, e.g., from mammalian tissue or 
in vitro cultured cells. 

[0151] Mouse Assay. The present invention comprises a mutant, transgenic mouse which 
lacks any functional NPC1L1. This mouse may serve as a convenient control experiment in 
screening assays for identifying inhibitors of intestinal sterol (e.g., cholesterol) or 5a-stanol 
absorption, preferably inhibitors of NPC1L1. Preferably, a mouse assay of the present invention 
would comprise the following steps: 

(a) feeding a sterol (e.g., cholesterol) or 5a-stanol-containing substance (e.g., 
comprising radiolabeled cholesterol, such as I4 C-cholesterol or 3 H-cholesterol) to a first and 
second mouse comprising a functional NPC1L1 gene and to a third, mutant mouse lacking a 
functional NPC1L1; 

[0152] The sterol (e.g., cholesterol) or 5a-stanol containing substance preferably contains 
labeled cholesterol, such as a radiolabeled cholesterol, for example, 3 H or 14 C labeled 
cholesterol. The sterol (e.g., cholesterol) or 5a-stanol containing substance may also include 
cold, unlabeled sterol (e.g., cholesterol) or 5a-stanol such as in corn oil. 

[0153] In these assays, the third npclll' mutant mouse serves as a (+)-control experiment 
which exhibits low levels of intestinal sterol (e.g., cholesterol) or 5oc-stanol absorption and the 
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second mouse serves as a (-)-control experiment which exhibits normal, uninhibited levels of 
intestinal sterol (e.g., cholesterol) or 5a-stanol absorption. The second mouse is not 
administered the sample to be tested for an NPC1L1 antagonist. The first mouse is the 
experiment. 

(b) administering the sample to the first mouse comprising a functional 
NPC1L1 but not to the second mouse; 

(c) measuring the amount of sterol {e.g., cholesterol) or 5a-stanol absorption 
in the intestine of said first, second and third mouse; 

[0154] Intestinal sterol (e.g., cholesterol) or 5<x-stanol absorption may be measured by any 
method known in the art. For example, the level intestinal absorption can be assayed by 
measuring the level of serum sterol (e.g., cholesterol) or 5a-stanol. 

(d) comparing the levels of intestinal sterol (e.g. , cholesterol) or 5ct-stanol 
absorption in each mouse; 

wherein the sample is determined to contain the intestinal sterol (e.g., 
cholesterol) or 5a-stanol absorption antagonist when the level of intestinal sterol (e.g., 
cholesterol) or 5a-stanol absorption in the first mouse and in the third mouse are less than the 
amount of intestinal sterol (e.g., cholesterol) or 5a-stanol absorption in the second mouse. 
[0155] Preferably, if the sample contains an intestinal sterol (e.g., cholesterol) or 5oc-stanol 
absorption inhibitor (e.g., an NPC1L1 inhibitor), the level of sterol (e.g., cholesterol) or 5a- 
stanol absorption in the first mouse will be similar to that of the third, npclll mutant mouse. 
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[0156] An alternative, (+)-control experiment which may be used in these screening assays is 
a mouse comprising functional NPC1L1 which is administered a known antagonist of NPC1L1, 
such as ezetimibe. 

Pharmaceutical Compositions 
[0157] NPC1L1 ligands discovered, for example, by the screening methods described above 
may be used therapeutically (e.g., in a pharmaceutical composition) to stimulate or block the 
activity of NPC 1 L 1 and, thereby, to treat any medical condition caused or mediated by NPC 1 L 1 . 
In addition, the antibody molecules of the invention may also be used therapeutically (e.g., in a 
pharmaceutical composition) to bind NPC 1 LI and, thereby, block the ability of NPC 1 LI to bind 
a sterol (e.g., cholesterol) or 5a-stanol. Blocking the binding of a sterol (e.g., cholesterol) or 5a- 
stanol would prevent absorption of the molecule (e.g., by intestinal cells such as enterocytes). 
Blocking absorption of sterol (e.g., cholesterol) or 5ct-stanol would be a useful way to lower 
serum sterol (e.g., cholesterol) or 5a-stanol levels in a subject and, thereby, reduce the incidence 
of, for example, hyperlipidemia, atherosclerosis, coronary heart disease, stroke or 
arteriosclerosis. 

[0158] The term "subject" or "patient" includes any organism, preferably animals, more 
preferably mammals (e.g., mice, rats, rabbits, dogs, horses, primates, cats) and most preferably 
humans. 

[0159] The term "pharmaceutical composition" refers to a composition including an active 
ingredient and a pharmaceutically acceptable carrier and/or adjuvant. 
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[0160] Although the compositions of this invention could be administered in simple solution, 
they are more typically used in combination with other materials such as carriers, preferably 
pharmaceutically acceptable carriers. Useful, pharmaceutically acceptable carriers can be any 
compatible, non-toxic substances suitable for delivering the compositions of the invention to a 
subject. Sterile water, alcohol, fats, waxes, and inert solids may be included in a 
pharmaceutically acceptable carrier. Pharmaceutically acceptable adjuvants (buffering agents, 
dispersing agents) may also be incorporated into the pharmaceutical composition. 

[0161] Preferably, the pharmaceutical compositions of the invention are in the form of a pill 
or capsule. Methods for formulating pills and capsules are very well known in the art. For 
example, for oral administration in the form of tablets or capsules, the active drug component 
may be combined with any oral, non-toxic pharmaceutically acceptable inert carrier, such as 
lactose, starch, sucrose, cellulose, magnesium stearate, dicalcium phosphate, calcium sulfate, 
talc, mannitol, ethyl alcohol (liquid forms) and the like. Moreover, when desired or needed, 
suitable binders, lubricants, disintegrating agents and coloring agents may also be incorporated in 
the mixture. Suitable binders include starch, gelatin, natural sugars, corn sweeteners, natural and 
synthetic gums such as acacia, sodium alginate, carboxymethylcellulose, polyethylene glycol and 
waxes. Among the lubricants there may be mentioned for use in these dosage forms, boric acid, 
sodium benzoate, sodium acetate, sodium chloride, and the like. Disintegrants include starch, 
methylcellulose, guar gum and the like. Sweetening and flavoring agents and preservatives may 
also be included where appropriate. 
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[0162] The pharmaceutical compositions of the invention may be administered in 
conjunction with a second pharmaceutical composition or substance. In preferred embodiments, 
the second composition includes a cholesterol-lowering drug. When a combination therapy is 
used, both compositions may be formulated into a single composition for simultaneous delivery 
or formulated separately into two or more compositions (e.g., a kit). 

[0163] The formulations may conveniently be presented in unit dosage form and may be 
prepared by any methods well known in the art of pharmacy. See, e.g., Gilman et al. (eds.) 
(1990), The Pharmacological Bases of Therapeutics, 8th Ed., Pergamon Press; and Remington's 
Pharmaceutical Sciences, supra, Easton, Penn.; Avis et al. (eds.) (1993) Pharmaceutical Dosage 
Forms: Parenteral Medications Dekker, New York; Lieberman et al. (eds.) (1990) 
Pharmaceutical Dosage Forms: Tablets Dekker, New York; and Lieberman et al. (eds.) (1990), 
Pharmaceutical Dosage Forms: Disperse Systems Dekker, New York. 

[0164] The dosage regimen involved in a therapeutic application may be determined by a 
physician, considering various factors which may modify the action of the therapeutic substance, 
e.g., the condition, body weight, sex and diet of the patient, the severity of any infection, time of 
administration, and other clinical factors. Often, treatment dosages are titrated upward from a 
low level to optimize safety and efficacy. Dosages may be adjusted to account for the smaller 
molecular sizes and possibly decreased half-lives (clearance times) following administration. 

[0165] An "effective amount" of a ligand of the invention may be an amount that will 
detectably reduce the level of intestinal sterol (e.g., cholesterol) or 5a-stanol absorption or 
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detectably reduce the level of serum sterol (e.g., cholesterol) or 5oc-stanol in a subject 
administered the composition. 

[0166] Typical protocols for the therapeutic administration of such substances are well 
known in the art. Pharmaceutical composition of the invention may be administered, for 
example, by any parenteral or non-parenteral route. 

[0167] Pills and capsules of the invention can be administered orally. Injectable 
compositions can be administered with medical devices known in the art; for example, by 
injection with a hypodermic needle. 

[0168] Injectable pharmaceutical compositions of the invention may also be administered 
with a needleless hypodermic injection device; such as the devices disclosed in U.S. Patent Nos. 
5,399,163; 5,383,851; 5,312,335; 5,064,413; 4,941,880; 4,790,824 or 4,596,556. 

Anti-Sense 

[0169] The present invention also encompasses anti-sense oligonucleotides capable of 
specifically hybridizing to mRNA encoding NPC1L1 (e.g., any of SEQ ID NOs: 1, 3, 5-1 1 or 13) 
having an amino acid sequence defined by, for example, SEQ ID NO: 2 or 4 or 12 or a 
subsequence thereof so as to prevent translation of the mRNA. Additionally, this invention 
contemplates anti-sense oligonucleotides capable of specifically hybridizing to the genomic 
DNA molecule encoding NPC1L1, for example, having an amino acid sequence defined by SEQ 
ID NO: 2 or 4 or 12 or a subsequence thereof. 
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[0170] This invention further provides pharmaceutical compositions comprising (a) an 
amount of an oligonucleotide effective to reduce NPC1 LI -mediated sterol (e.g., cholesterol) or 
5a-stanol absorption by passing through a cell membrane and binding specifically with mRNA 
encoding NPC1L1 in the cell so as to prevent its translation and (b) a pharmaceutically 
acceptable carrier capable of passing through a cell membrane. In an embodiment, the 
oligonucleotide is coupled to a substance that inactivates mRNA. In another embodiment, the 
substance that inactivates mRNA is a ribozyme. 

[0171] Reducing the level of NPC1L1 expression by introducing anti-sense NPC1L1 RNA 
into the cells of a patient is a useful method reducing intestinal sterol (e.g., cholesterol) or 5 - 
stanol absorption and serum cholesterol in the patient. 

Kits 

[0172] Kits of the present invention include ezetimibe, preferably combined with a 
pharmaceutically acceptable carrier, in a pharmaceutical formulation, more preferably in a 
pharmaceutical dosage form such as a pill, a powder, an injectable liquid, a tablet, dispersible 
granules, a capsule, a cachet or a suppository. See for example, Gilman et al. (eds.) (1990), The 
Pharmacological Bases of Therapeutics, 8th Ed., Pergamon Press; and Remington's 
Pharmaceutical Sciences, supra, Easton, Penn.; Avis etal. (eds.) (1993) Pharmaceutical Dosage 
Forms: Parenteral Medications Dekker, New York; Lieberman et al. (eds.) (1990) 
Pharmaceutical Dosage Forms: Tablets Dekker, New York; and Lieberman et al. (eds.) (1990), 
Pharmaceutical Dosage Forms: Disperse Systems Dekker, New York. Preferably, the dosage 
form is a Zetia® tablet (Merck/Schering-Plough Corp.). Ezetimibe may be supplied in any 
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convenient form. For example, tablets including ezetimibe may be supplied in bottles of 30, 90 
or 500. 

[0173] The kits of the present invention also include information, for example in the form of 
a package insert, indicating that the target of ezetimibe is NPC1L1 (NPC3). The term "target of 
ezetimibe" indicates that ezetimibe reduces intestinal sterol (e.g., cholesterol) or 5a-stanol 
absorption, either directly or indirectly, by antagonizing NPC1L1 . The form of the insert may 
take any form, such as paper or on electronic media such as a magnetically recorded medium 
(e.g., floppy disk) or a CD-ROM. 

[0174] The package insert may also include other information concerning the pharmaceutical 
compositions and dosage forms in the kit. Generally, such information aids patients and 
physicians in using the enclosed pharmaceutical compositions and dosage forms effectively and 
safely. For example, the following information regarding ezetimibe (e.g., Zetia®) and/or 
simvastatin (e.g., Zocor®) may be supplied in the insert: pharmacokinetics, pharmacodynamics, 
clinical studies, efficacy parameters, indications and usage, contraindications, warnings, 
precautions, adverse reactions, overdosage, proper dosage and administration, how supplied, 
proper storage conditions, references and patent information. 

[0175] The kits of the invention may also include simvastatin ( 
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) preferably combined with a pharmaceutically acceptable carrier, in a pharmaceutical 
formulation, more preferably in a pharmaceutical dosage form such as a pill, a powder, an 
injectable liquid, a tablet, dispersible granules, a capsule, a cachet or a suppository. Preferably, 
the dosage form of simvastatin is a Zocor® tablet (Merck & Co.; Whitehouse Station, NJ). 
[0099] Tablets or pills comprising simvastatin may be supplied in any convenient form. For 
example, pills or tablets comprising 5mg simvastatin can be supplied as follows: bottles of 30, 
60, 90, 100 or 1000. Pills or tablets comprising 10 mg simvastatin may be supplied as follows: 
bottles of 30, 60, 90, 100, 1000 or 10,000. Pills or tablets comprising 20 mg simvastatin may be 
supplied as follows: bottles of 30, 60, 90, 100, 1000 or 10,000. Pills or tablets comprising 40 mg 
simvastatin may be supplied as follows: bottles of 30, 60, 90, 100 or 1000. Pills or tablets 
comprising 80 mg simvastatin may be supplied as follows: bottles of 30, 60, 90, 100, 1000 or 
10,000. 

[0176] Ezetimibe and simvastatin may be supplied, in the kit, as separate compositions or 
combined into a single composition. For example, ezetimibe and simvastatin may be supplied 
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within a single, common pharmaceutical dosage form {e.g., pill or tablet) as in separate 
pharmaceutical dosage forms {e.g., two separate pills or tablets). 

npclir Cells 

[0177] The present invention provides any isolated mammalian cell, {e.g., an isolated mouse 
cell, an isolated rat cell or an isolated human cell) which lacks an NPC1L1 gene which encodes 
or can produce a functional NPC1L1 protein. Included within this embodiment are mutant 
npclll genes comprising a point mutation, truncation or deletion of the genetic coding region or 
of any regulatory element {e.g., a promoter). 

[0178] For example, the cell can be isolated from a mutant mouse comprising a 
homozygous mutation of endogenous, chromosomal NPC1L1 wherein the mouse does not 
produce any functional NPC1L1 protein {e.g., the mouse described below in Example 22). 
Moreover, the present invention comprises any cell, tissue, organ, fluid, nucleic acid, peptide or 
other biological substance derived or isolated from such a mutant mouse, particularly a mutant, 
transgenic mouse which does not produce any functional NPC1L1, wherein the region of 
endogenous, chromosomal NPC1L1 deleted, in the mouse, corresponds to nucleotides 790-998 
of the nucleotide sequence set forth in SEQ ID NO: 45. 

[0179] The isolated cell can be isolated or derived, for example, from the duodenum, gall 
bladder, liver, small intestine or stomach of the mutant mouse. Further, the cell can be an 
enterocyte. 

[0180] The npclir mutant cells are useful, for example, for use in control experiments in 
screening assays (see e.g., supra) since they lack any NPC1 LI -dependent uptake or binding of 
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sterol, 5ot-stanol or ezetimibe. The level of inhibition caused by a particular sample, in a 
screening assay, can be compared to that of an assay performed with the mutant cell. Ideally, 
though by no means necessarily, in a screening assay, for example, as described herein, the same 
amount of binding will be observed by a non-mutant cell or cell membrane, in the presence of an 
antagonist, as is observed in connection with a mutant npclll cell or cell membrane alone. 

EXAMPLES 

[0181] The following examples are provided to more clearly describe the present invention 
and should not be construed to limit the scope of the invention in any way. 

Example 1: Cloning and Expression of Rat, Mouse and Human NPC1L1. 
[00100] Rat NPC, mouse NPC1L1 or human NPC1L1 can all conveniently be amplified using 
polymerase chain reaction (PCR). In this approach, DNA from a rat, mouse or human cDNA 
library can be amplified using appropriate primers and standard PCR conditions. Design of 
primers and optimal amplification conditions constitute standard techniques which are 
commonly known in the art. 

[0182] An amplified NPC1L1 gene may conveniently be expressed, again, using methods 
which are commonly known in the art. For example, NPC 1 LI may be inserted into a pET-based 
plasmid vector (Stratagene; La Joola, CA), downstream of the T7 RNA polymerase promoter. 
The plasmid may then be transformed into a T7 expression system {e.g., BL21DE3 E.coli cells), 
grown in a liquid culture and induced {e.g., by adding IPTG to the bacterial culture). 
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Example 2: Direct Binding Assay. 

[0183] Membrane preparation : Caco2 cells transfected with an expression vector containing 
a polynucleotide encoding NPC1L1 (e.g., SEQ ID NO: 2, 4 or 12) are harvested by incubating in 
5 mM EDTA/phosphate-buffered saline followed by repeated pipeting. The cells are centrifuged 
5 min at 1000 x g. The EDTA/PBS is decanted and an equal volume of ice-cold 50mM Tris- 
HC1, pH 7.5 is added and cells are broken up with a Polytron (PT10 tip, setting 5, 30 sec). 
Nuclei and unbroken cells are sedimented at 1000 x g for 10 min and then the supernatant is 
centrifuged at 50,000 x g for 10 min. The supernatant is decanted, the pellet is resuspended by 
Polytron, a sample is taken for protein assay (bicinchoninic acid, Pierce), and the tissue is again 
centrifuged at 50,000 x g. Pellets are stored frozen at -20°C. 

[0184] Binding assay : For saturation binding, four concentrations of [ 3 H]-ezetimibe (15 
Ci/mmol) are incubated without and with 10" 5 M ezetimibe in triplicate with 50 |ag of membrane 
protein in a total volume of 200 ul of 50 mM Tris-HCl, pH 7.5, for 30 min at 30°C. Samples are 
filtered on GF/B filters and washed three times with 2 ml of cold Tris buffer. Filters are dried in 
a microwave oven, impregnated with Meltilex wax scintillant, and counted at 45% efficiency. 
For competition binding assays, five concentrations of a sample are incubated in triplicate with 
18 nM [ 3 H]-ezetimibe and 70 ug of membrane protein under the conditions described above. 
Curves are fit to the data with Prism (GraphPad Software) nonlinear least-squares curve-fitting 
program and Kj values are derived from IC 50 values according to Cheng and Prusoff (Cheng, Y. 
C, et al, (1973) Biochem. Pharmacol. 22: 3099-3108). 
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Example 3: SPA Assay. 

[0185] For each well of a 96 well plate, a reaction mixture of 10 ug human, mouse or rat 
NPC1L1-CHO overexpressing membranes (Biosignal) and 200 ug/well YSi-WGA-SPA beads 
(Amersham) in 100 ul is prepared in NPC1L1 assay buffer (25 mM HEPES, pH 7.8, 2 mM 
CaC12, ImM MgC12, 125 mM NaCl, 0.1% BSA). A 0.4 nM stock of ligand- [ 125 I]-ezetimibe- is 
prepared in the NPC1L1 assay buffer. The above solutions are added to a 96-well assay plate as 
follows: 50 ul NPC1L1 assay buffer, 100 ul of reaction mixture, 50 ul of ligand stock (final 
ligand concentration is 0.1 nM). The assay plates are shaken for 5 minutes on a plate shaker, 
then incubated for 8 hours before cpm/well are determined in Microbeta Trilux counter 
(PerkinElmer). 

[0186] These assays will indicate that [ 125 I]-ezetimibe binds to the cell membranes 
expressing human, mouse or rat NPC1L1. Similar results will be obtained if the same 
experiment is performed with radiolabeled cholesterol (e.g., l25 I-cholesterol). 

Example 4: Cholesterol Uptake Assay. 

[0187] CHO cells expressing either SR-B1 or three different clones of rat NPC1L1 or one 
clone of mouse NPC1L1 were starved overnight in cholesterol free media then dosed with [3H]- 
cholesterol in a mixed synthetic micelle emulsion for 4 min, 8 min, 12 min or 24 min in the 
absence or presence of 10 M ezetimibe. The cells were harvested and the lipids were 
organically extracted. The extracted lipids were spotted on thin-layer chromatography (TLC) 
plates and resolved within an organic vapor phase. The free cholesterol bands for each assay 
were isolated and counted in a scintillation counter. 
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[0188] The SR-B1 expressing cells exhibited an increase in [ 3 H]-cholesterol uptake as early 
as 4 min which was also inhibited by ezetimibe. The three rat clones and the one mouse clone 
appeared to give background levels of [ 3 H]-cholesterol uptake which was similar to that of the 
untransformed CHO cell. 

[0189] These experiments will yield data demonstrating that CHO cells can perform mouse, 
rat and human NPC1 LI -dependent uptake of [ 3 H]-cholesterol when more optimal experimental 
conditions are developed. 

Example 5: Expression of Rat NPC1L1 in Wistar Rat Tissue. 

[0190] In these experiments, the expression of rat NPC1L1 mRNA, in several rat tissues, 

was evaluated. The tissues evaluated were esophagus, stomach, duodenum, jejunum, ileum, 
proximal colon, distal colon, liver, pancreas, heart, aorta, spleen, lung, kidney, brain, muscle, 
testes, ovary, uterus, adrenal gland and thyroid gland. Total RNA samples were isolated from at 
least 3 male and 3 female animals and pooled. The samples were then subjected to real time 
quantitative PCR using Taqman analysis using standard dual-labeled fluorogenic oligonucleotide 
probes. Typical probe design incorporated a 5' reporter dye {e.g., 6FAM (6-carboxyfluorescein) 
or VIC) and a 3' quenching dye {e.g., TAMRA (6-carboxytetramethyl-rhodamine)). 

rdXNPClLl : 

[0191] Forward: TCTTCACCCTTGCTCTTTGC (SEQ ID NO: 14) 

[0192] Reverse: AATGATGGAGAGTAGGTTGAGGAT (SEQ ID NO: 15) 

[00101] Probe: [6FAM]TGCCCACCTTTGTTGTCTGCTACC[TAMRA] (SEQ ID NO: 16) 
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rat P-actin : 

[0193] Forward: ATCGCTGACAGGATGCAGAAG (SEQ ID NO: 17) 

[0194] Reverse: TCAGGAGGAGCAATGATCTTGA (SEQ ID NO: 1 8) 

[00102] Probe: [VIC]AGATTACTGCCCTGGCTCCTAGCACCAT[TAMRA] (SEQ ID NO: 
19) 

[0195] PCR reactions were run in 96-well format with 25 ul reaction mixture in each well 
containing: Platinum SuperMix (12.5 ul), ROX Reference Dye (0.5 ul), 50 mM magnesium 
chloride (2 ul), cDNA from RT reaction (0.2 ul). Multiplex reactions contained gene specific 
primers at 200 nM each and FAM labeled probe at 100 nM and gene specific primers at 100 nM 
each and VIC labeled probe at 50 nM. Reactions were run with a standard 2-step cycling 
program, 95°C for 15 sec and 60°C for 1 min, for 40 cycles. 

[0196] The highest levels of expression were observed in the duodenum, jejunum and ileum 
tissue. These data indicate that NPC1L1 plays a role in cholesterol absorption in the intestine. 

Example 6: Expression of Mouse NPC1L1 in Mouse Tissue. 

[0197] In these experiments, the expression of mouse NPC1L1 mRNA, in several tissues, 
was evaluated. The tissues evaluated were adrenal gland, BM, brain, heart, islets of langerhans, 
LI, small intestine, kidney, liver, lung, MLN, PLN, muscle, ovary, pituitary gland, placenta, 
Peyers Patch, skin, spleen, stomach, testes, thymus, thyroid gland, uterus and trachea. Total 
RNA samples were isolate from at least 3 male and 3 female animals and pooled. The samples 
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were then subjected to real time quantitative PCR using Taqman analysis using the following 
primers and probes: 

mouse NPC1L1: 

[0198] Forward: ATCCTCATCCTGGGCTTTGC (SEQ ID NO: 20) 

[0199] Reverse: GCAAGGTGATCAGGAGGTTGA (SEQ ID NO: 2 1) 

[0200] Probe: [6FAM]CCCAGCTTATCCAGATTTTCTTCTTCCGC[TAMRA] (SEQ ID 
NO: 22) 

[0201] The highest levels of expression were observed in the Peyer's Patch, small intestine, 
gall bladder and stomach tissue. These data are consistent with a cholesterol absorption role for 
NPC1L1 which takes place in the digestive system. 

Example 7: Expression of Human NPC1L1 in Human Tissue. 

[0202] In these experiments, the expression level of human NPC1L1 mRNA was evaluated 
in 2045 samples representing 46 normal tissues. Microarray-based gene expression analysis was 
performed on the Affymetrix HG-U95 GeneChip using a cRNA probe corresponding to base 
pairs 4192-51 17 (SEQ ID NO: 43) in strict accordance to Affymetrix's established protocols. 
Gene Chips were scanned under low photo multiplier tube (PMT), and data were normalized 
using either Affymetrix MAS 4.0 or MAS 5.0 algorithms. In addition "spike ins" for most 
samples were used to construct a standard curve and obtain RNA concentration values according 
Gene Logic algorithms and procedures. A summary of these results are indicated, below, in 
Table 2. 
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Table 2, Expression level of Mill mRNA in various human tissues, 









Lower 




Upper 










Lower 




Upper 


Tissue 


Present 


Absent 


25% 


Median 


75% 




Tissue 


Present 


Absent 


25% 


Median 


75% 


Adipose 


2 of 32 


30of 32 




1.16 


12.23 




Liver 


32 of 34 


2 of 34 


325.74 


427.77 


540.1 


Adrenal Gland 


Oof 12 


12 of 12 


•xm 


•4.47 


1041 




Lung 


2 of 93 


91 of 93 


•347 


11.03 


2244 


Appendix 


Oof3 


3 of 3 


•8.02 


•6.69 


38.19 




Lymph Node 


Oof 11 


11 of 11 


■1.78 


■0.19 


144 


Artery 


Oof3 


3 of 3 


•6,59 


•4.67 


9.68 




Muscles 


Oof 39 


39 of 39 


•21.57 


8.25 


26.73 


Bladder 


lof5 


4ofS 


•22 


•745 


•1.99 




Myometrium 


8 of 106 


98 of 106 


•3.98 


447 


1745 


Bone 


Oof3 


3 of 3 


•1,(4 


3.3 


19.53 




Omentum 


Oof 15 


15 of 15 


-14.25 


•1.6 


1948 


Breast 


4 of 80 


76 of 80 


•4.07 


3,13 


14.67 




Ovary 


lof 74 


73 of 74 


0.5 


17,51 


38.28 


Cerebellum 


Oof 5 


SofS 


•344 


3.24 


1S48 




Pancreas 


Oof 34 


34 of 34 


•87.08 


-53.2 


•24.14 


Cervix 


3 of 101 


98 of 101 


•746 


•047 


20.89 




Placenta 


0of5 


5of 5 


•20.4 


•3.44 


18.91 


Colon 


9 of 151 


142 of 151 


•10.19 


0J1 


1846 




Prostate 


Oof 32 


32of32 


148 


15,56 


27.24 


Cortex frontal Lobe 


0of7 


7of 7 


1.4 


846 


11.75 




Rectum 


lof 43 


42of43 


•9.26 


■1.49 


. 9.8 


Cortex Temporal Lobs 


Oof 3 


3of3 


7.1 


8.5 


1547 




Right Atrium 


4 of 169 


165 of 169 


•19,32 


•6.58 


7.72 


Duodenum 


59 of 61 


;' 2of 61 


519.23 


827.43 


: 1101.67 




RJghtVenWcIa 


lof 160 


159 of 160 


•24.01 


-6.49 


10.06 


Endometrium 


Oof 21 


21 of 21 


•14,43 


-639 


2,79 




Skin 


Oof 59 


59 of 59 


•12.68 


14 


22.77 


Esophagus 


1 of 27 


26 of 27 


■10,93 


•4.97 


12,48 




Small Intestine 


4G of 68 


22 of 68 


21.21 


493.93 


939,2 


Fallopian Tube 


3 of 51 


48 of 51 


5,02 


13,24 


2M7 




Soft Tissual 


1 of 6 


5of6 


•1.99 


2.6 


5.32 




8 of 8 


Oof 8 


205.76 


• 273.39 


4228 




Spleen 


Oof 31 


31 of 31 


•9.41 


-0.31 


93 
















Heart 


Oof 3 


3 of 3 


3,33 


11,19 


11.66 




Stomach 


7 of 47 


40 of 47 


19.02 


52.29 


117,09 


Hippocampus 


Oof 5 


5of5 


8.25 


9.11 


1943 




Testis 


Oof S 


5of5 


■441 


1.22 


11,2 


Kidney 


4 of 86 


82 of 86 


•8.36 


3.41 


16.46 




Thymus 


lof 71 


70 of 71 


-6.26 


2.51 


11,67 


Larynx 


Oof 4 


4of4 


•13.76 


•0,81 


8.54 




Thyroid Gland 


lof 18 


17 of IS 


•12.22 


2.84 


1746 


Left Atrium 


2 of 141 


139 of 141 


•18.9 


•448 


644 




Uterus 


Oof 58 


58 of 58 


•10.67 


1.59 


16.01 


Left Ventricle 


Oof 15 


15 of 15 


•21,19 


•9.59 


17,7 




WBC 


3 of 40 


37 of 40 


•1645 


•0.72 


25.18 



[0203] Shaded data corresponds to tissues wherein the highest levels of NPC1L1 mRNA was 
detected. The "Present" column indicates the proportion of specified tissue samples evaluated 
wherein NPC1L1 mRNA was detected. The "Absent" column indicates the proportion of 
specified tissue samples evaluated wherein NPC1L1 RNA was not detected. The "lower 25%", 
"median" and "upper 75%" columns indicate statistical distribution of the relative NPC1L1 
signal intensities observed for each set of tissue evaluated. 
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Example 8: Distribution of Rat NPC1L1, Rat IBA T or Rat SR-B1 mRNA in Rat Small 
Intestine. 

[0204] In these experiments, the distribution of rat NPC1L1 mRNA along the proximal-distal 
axis of rat small intestines was evaluated. Intestines were isolated from five independent animals 
and divided into 10 sections of approximately equal length. Total RNA was isolated and 
analyzed, by real time quantitative PCR using Taqman analysis, for localized expression levels 
of rat NPC1L1, rat IB AT (ileal bile acid transporter) or rat SR-B1 mRNA. The primers and probes 
used in the analysis were: 

rat NPC1 LI : 

[0205] Forward: TCTTCACCCTTGCTCTTTGC (SEQ ID NO: 23) 

[0206] Reverse: AATGATGGAGAGTAGGTTGAGGAT (SEQ ID NO: 24) 

[0207] Probe: [6FAM]TGCCCACCTTTGTTGTCTGCTACC[TAMRA] (SEQ ID NO: 25) 

rat Villin : 

[0208] Forward: AGCACCTGTCCACTGAAGATTTC (SEQ ID NO: 26) 

[0209] Reverse: TGGACGCTGAGCTTCAGTTCT (SEQ ID NO: 27) 

[0210] Probe: [VIC]CTTCTCTGCGCTGCCTCGATGGAA[TAMRA] (SEQ ID NO: 28) 

ratSfl-Bl : 

[0211] Forward: AGTAAAAAGGGCTCGCAGGAT (SEQ ID NO: 29) 
[0212] Reverse: GGCAGCTGGTGACATCAGAGA (SEQ ID NO: 30) 
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[0213] Probe: [6FAM]AGGAGGCCATGCAGGCCTACTCTGA[TAMRA] (SEQ ID NO: 
31) 

rat IB AT : 

[0214] Forward: GAGTCCACGGTCAGTCCATGT (SEQ ID NO: 32) 

[0215] Reverse: TTATGAACAACAATGCCAAGCAA (SEQ ID NO: 33) 

[0216] Probe: [6FAM]AGTCCTTAGGTAGTGGCTTAGTCCCTGGAAGCTC[TAMRA] 
(SEQ ID NO: 34) 

[0217] The mRNA expression levels of each animal intestinal section were analyzed 
separately, then the observed expression level was normalized to the observed level of villin 
mRNA in that intestinal section. The observed, normalized mRNA expression levels for each 
section where then averaged. 

[0218] The expression level of NPC1L1 and SR-B1 were highest in the jejunum (sections 2- 
5) as compared to that of the more distal ileum sections. Since the jejunum is believed to be the 
site of cholesterol absorption, these data suggest such a role for rat NPC1L1 . IB A T distribution 
favoring the ileum is well document and served as a control for the experiment. 

Example 9: In situ Analysis of Rat NPC1L1 mRNA in Rat Jejunum Tissue. 

[0219] The localization of rat NPC1L1 mRNA was characterized by in situ hybridization 

analysis of rat jejunum serial sections. The probes used in this analysis were: 



NY02:471952.I 



-71- 



P36134 074669.0105 



[0220] T7-sense probe : GTAATACGACTCACTATAGGGCCCTGACGGTCCTTCCTGA 
GGGAATCTTCAC (SEQ ID NO: 35) 

[0221] T7-antisense probe : GTAATACGACTCACTATAGGGCCTGGGAAGTTGGTCAT 
GGCCACTCCAGC (SEQ ID NO: 36) 

[0222] The RNA probes were synthesized using T7 RNA polymerase amplification of a PCR 
amplified DNA fragment corresponding rat NPC1L1 nucleotides 3318 to 3672 (SEQ ED NO 1). 
Sense and anti-sense digoxigenin-UTP labeled cRNA probes were generated from the T7 
promoter using the DIG RNA Labeling Kit following the manufacturer's instructions. Serial 
cryosections rat jejunum were hybridized with the sense and antiisense probes. Digoxigenin 
labeling was detected with the DIG Nucleic Acid Detection Kit based on previous methods. A 
positive signal is characterized by the deposition of a red reaction product at the site of 
hybridization. 

[0223] The anti-sense probe showed strong staining of epithelium along the crypt-villus axis 
under low magnification (40X). The observed rat NPC1L1 mRNA expression levels may have 
been somewhat greater in the crypts than in the villus tips. Under high magnification (200X), 
staining was observed in the enterocytes but not in the goblet cells. A lack of staining observed 
with the sense probe (control) confirmed the high specificity of the NPC1L1 anti-sense signal. 
These data provided further evidence of the role of rat NPC1L1 in intestinal cholesterol 
absorption. 

Example 10: FACS Analysis of Fluorescently Labeled Ezetimibe Binding to Transiently 
Transfected CHO Cells. 
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[0224] In these experiments, the ability of BODIPY-labeled ezetimibe (Altmann, et al, 
(2002) Biochim. Biophys. Acta 1580(1): 77-93) to bind to NPC1L1 and SR-B1 was evaluated. 
"BODIPY" is a fluorescent group which was used to detect the BODIPY-ezetimibe. Chinese 
hamster ovary (CHO) cells were transiently transfected with rat NPC1L1 DNA 
(rNPClLl/CHO), mouse NPC1L1 DNA (mNPClLl/CHO), mouse SR-B1 DNA (mSRBI/CHO) 
or EGFP DNA (EGFP/CHO). EGFP is enhanced green fluorescent protein which was used as a 
positive control. The transfected CHO cells or untransfected CHO cells were then stained with 
100 nM BODEPY-labeled ezetimibe and analyzed by FACS. Control experiments were also 
performed wherein the cells were not labeled with the BODIPY-ezetimibe and wherein 
untransfected CHO cells were labeled with the BODIPY-ezetimibe. 

[0225] No staining was observed in the untransfected CHO, rNPClLl/CHO or 
mNPClLl/CHO cells. Fluorescence was detected in the positive-control EGFP/CHO cells. 
Staining was also detected in the mouse SR-B1/CHO cells. These data show that, under the 
conditions tested, BODIPY-ezetimibe is capable of binding to SR-B1 and that such binding is 
not ablated by the presence of the fluorescent BODIPY group. When more optimal conditions 
are determined, BODIPY-ezetimibe will be shown to label the rNPClLl/CHO and 
mNPClLl/CHO cells. 

Example 11: FACS Analysis of Transiently Transfected CHO Cells Labeled with Anti- 
FLAG Antibody M2. 

[0226] In these experiments, the expression of FLAG-tagged NPC1L1 on CHO cells was 
evaluated. CHO cells were transiently transfected with mouse NPC1L1 DNA, rat NPC1L1 
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DNA, FLAG- rat NPC1L1 DNA or FLAG- mouse NPC1L1 DNA. The 8 amino acid FLAG tag 
used was DYKDDDDK (SEQ ID NO: 37) which was inserted on the amino-terminal 
extracellular loop just past the secretion signal sequence. The cells were incubated with 
commercially available anti-FLAG monoclonal mouse antibody M2 followed by a BODEPY- 
tagged anti-mouse secondary antibody. The treated cells were then analyzed by FACS. 

[0227] The M2 antibody stained the CHO cells transfected with FLAG-rat NPC1L1 DNA 
and with FL AG-mouse NPC1L1. No staining was observed in the CHO cells transfected with 
mouse NPC1L1 DNA and with rat NPC1L1 DNA. These data showed that rat NPC1L1 and 
mouse NPC1L1 possess no significant, inherent fluorescence and are not bound by the anti- 
FLAG antibody. The observed, FLAG-dependent labeling of the cells indicated that the FLAG- 
mouse NPC1L1 and FLAG-rat NPC1L1 proteins are localized at the cell membrane of the CHO 
cells. 

Example 12: FACS Analysis of FLAG-rat NPC1L1-EGFP Chimera in Transiently 
Transfected CHO Cells. 

[0228] In these experiments, the surface and cytoplasmic localization of rat NPC1L1 in CHO 
cells was evaluated. CHO cells were transiently transfected with FLAG- rat NPC1L1 DNA or 
with FLAG-rat NPC1L1-EGFP DNA. In these fusions, the FLAG tag is at amino-terminus of 
rat NPC1L1 and EGFP fusion is at the carboxy-terminus of rat NPC1L1. The cells were then 
stained with the M2 anti-FLAG mouse (primary) antibody followed by secondary staining with a 
BODEPY-labeled anti-mouse antibody. In control experiments, cells were stained with only the 
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secondary antibody and not with the primary antibody (M2). The stained cells were then 
analyzed by FACS. 

[0229] In a control experiment, FLAG-rat NPC1L1 transfected cells were stained with 
BODIPY anti-mouse secondary antibody but not with the primary antibody. The data 
demonstrated that the secondary, anti-mouse antibody possessed no significant specificity for 
FLAG-rat NPC1L1 and that the FLAG-rat NPC1L1, itself, possesses no significant fluorescence. 

[0230] In another control experiment, unlabeled FLAG-rat NPC 1 LI -EGFP cells were FACS 
analyzed. In these experiments, autofluorescence of the enhanced green fluorescent protein 
(EGFP) was detected. 

[0231] FLAG-rat NPC 1 LI cells were stained with anti-FLAG mouse antibody M2 and with 
the BODIPY-labeled anti-mouse secondary antibody and FACS analyzed. The data from this 
analysis showed that the cells were labeled with the secondary, BODIPY-labeled antibody which 
indicated expression of the FLAG-rat NPC 1 LI protein on the surface of the CHO cells. 

[0232] FLAG-rat NPC 1 LI -EGFP cells were stained with anti-FLAG mouse antibody M2 
and with the BODIPY-labeled anti-mouse secondary antibody and FACS analyzed. The data 
from this analysis showed that both markers (BODIPY and EGFP) were present indicating 
surface expression of the chimeric protein. The data also indicated that a portion of the protein 
was located within the cells and may be associated with transport vesicles. These data supported 
a role for rat NPC 1 LI in vesicular transport of cholesterol or protein expressed in subcellular 
organelles such as the rough endoplasmic reticulum. 
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Example 13: FACS Analysis and Fluorescent Microscopy of FLAG-rat NPC1L1-EGFP 
Chimera in a Cloned CHO Cell Line. 

[0233] In these experiments, the cellular localization of rat NPC1L1 was evaluated by FACS 
analysis and by immunohistochemistry. CHO cells were transfected with FLAG-rat NPC1L1- 
EGFP DNA and stained with anti-FLAG mouse antibody M2 and then with a BODIPY-labeled 
anti-mouse secondary antibody. In the fusion, the FLAG tag is at the amino-terminus of rat 
NPC1L1 and the enhanced green fluorescent protein (EGFP) tag is located at the carboxy- 
terminus of the rat NPC1L1 . The stained cells were then analyzed by FACS and by fluorescence 
microscopy. 

[0234] Cells transfected with FLAG-rat NPC1L1-EGFP DNA were stained with the anti- 
FLAG mouse antibody M2 and then with the BODIPY-labeled anti-mouse secondary antibody. 
FACS analysis of the cells detected both markers indicating surface expression of the chimeric 
protein. 

[0235] FLAG-rat NPC 1 L 1 -EGFP transfected cells were analyzed by fluorescent microscopy 
at 63X magnification. Fluorescent microscopic analysis of the cells indicated non-nuclear 
staining with significant perinuclear organelle staining. Resolution of the image could not 
confirm the presence of vesicular associated protein. These data indicated that the fusion protein 
was expressed on the cell membrane of CHO cells. 

Example 14: Generation of Polyclonal Anti-rat NPC1L1 Rabbit Antibodies. 

[0236] Synthetic peptides (SEQ ID NO: 39-42) containing an amino- or carboxy-terminal 

cysteine residue were coupled to keyhole limpet hemocyanin (KLH) carrier protein through a 
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disulfide linkage and used as antigen to raise polyclonal antiserum in New Zealand white rabbits 
(range 3-9 months in age). The KLH-peptide was emulsified by mixing with an equal volume of 
Freund's Adjuvant, and injected into three subcutaneous dorsal sites. Prior to the 16 week 
immunization schedule a pre-immune sera sample was collected which was followed by a 
primary injection of 0.25 mg KLH-peptide and 3 scheduled booster injections of 0. 1 mg KLH- 
peptide. Animals were bled from the auricular artery and the blood was allowed to clot and the 
serum was then collected by centrifugation 

[0237] The anti-peptide antibody titer was determined with an enzyme linked 
immunosorbent assay (ELISA) with free peptide bound in solid phase (lug/well). Results are 
expressed as the reciprocal of the serum dilution that resulted in an OD450 of 0.2. Detection was 
obtained using the biotinylated anti-rabbit IgG, horse radish peroxidase-streptavidin (HRP-SA) 
conjugate, and ABTS. 

Example 15: FACS Analysis of Rat NPC1L1 Expression in CHO Cells Transiently 
Transfected with Rat NPC1L1 DNA Using Rabbit Anti-rat NPC1L1 Antisera. 
[00103] In these experiments, the expression of rat NPC1L1 on the surface of CHO cells was 
evaluated. CHO cells were transfected with rat NPC1L1 DNA, then incubated with either rabbit 
preimmune serum or with 10 week anti-rat NPC1L1 serum described, above, in Example 14 (i.e., 
A0715, A0716, A0867 or A0868). Cells labeled with primary antisera were then stained with a 
BODIPY-modified anti-rabbit secondary antibody followed by FACS analysis. 

[0238] No antibody surface labeling was observed for any of the pre-immune sera samples. 
Specific cell surface labeling of rat NPC1L1 transfected cells was observed for both A0715 and 
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A0868. Antisera A0716 and A0867 did not recognize rat NPC1L1 surface expression in this 
assay format. This indicates that the native, unfused rat NPC1L1 protein is expressed in the 
CHO cells and localized to the CHO cell membranes. Cell surface expression of NPC1L1 is 
consistent with a role in intestinal cholesterol absorption. 

Example 16: FACS Analysis of CHO Cells Transiently Transfected with FLAG-Mouse 
NPC1L1 DNA or FLAG-rat NPC1L1 DNA or Untransfected CHO Cells Using Rabbit 
Anti-rat NPC1L1 Antisera. 

[0239] In these experiments, the expression of FLAG-mouse NPC1L1 and FLAG-rat 
NPC1L1 in CHO cells was evaluated. CHO cells were transiently transfected with FLAG-mouse 
NPC1L1 DNA or with FLAG-rat NPC1L1 DNA. The FLAG-mouse NPC1L1 and FLAG-rat 
NPC1L1 transfected cells were labeled with either A0801, A0802, A0715 or A0868 sera (see 
Example 14) or with anti-FLAG antibody, M2. The labeled cells were then stained with 
BODIPY-labeled anti-rabbit secondary antibody and FACS analyzed. The untransfected CHO 
cells were analyzed in the same manner as the transfected cell lines. 

[0240] Positive staining of the untransfected CHO cells was not observed for any of the 
antisera tested. Serum A0801 -dependent labeling of FLAG-rat NPC1L1 transfected cells was 
observed but such labeling of FLAG-mouse NPC1L1 transfected cells was not observed. Serum 
A0802-dependent labeling of FLAG-mouse NPC1L1 or FLAG-rat NPC1L1 transfected cells was 
not observed. Strong serum A07 15 -dependent labeling of FLAG-rat NPC1L1 transfected cells 
was observed and weak serum A0715-dependent labeling of FLAG-mouse NPC1L1 transfected 
cells was observed. Weak serum A0868-dependent labeling of rat NPC1L1 and mouse NPC1L1 
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transfected cells was observed. Strong Anti-FLAG M2 antibody-dependent labeling of FLAG- 
rat NPC1L1 and FLAG-mouse NPC1L1 transfected cells was observed. The strong M2 staining 
is likely to be due to the fact that M2 is an affinity-purified, monoclonal antibody of known 
concentration. In contrast, the respective antisera are polyclonal, unpurified and contain an 
uncertain concentration of anti-rat NPC1L1 antibody. These date provide further evidence that 
the FLAG-mouse NPC1L1 and FLAG-rat NPC1L1 proteins are expressed in CHO cells and 
localized to the CHO cell membranes. Cell surface expression of NPC1L1 is consistent with a 
role in intestinal cholesterol absorption. 

Example 17: Immunohistochemical Analysis of Rat Jejunum Tissue with Rabbit Anti-rat 
NPC1L1 Antisera A0715. 

[0241] In these experiments, the localization of rat NPC1L1 in rat jejunum was analyzed by 
immunohistochemistry. Rat jejunum was removed, immediately embedded in O.C.T. compound 
and frozen in liquid nitrogen. Sections (6um) were cut with a cryostat microtome and mounted 
on glass slides. Sections were air dried at room temperature and then fixed in Bouin's fixative. 
Streptavidin-biotin-peroxidase immunostaining was carried out using Histostain-SP kit. 
Endogenous tissue peroxidase activity was blocked with a 10 minute incubation in 3% H2O2 in 
methanol, and nonspecific antibody binding was minimized by a 45 minute incubation in 10% 
nonimmune rabbit serum. Sections were incubated with a rabbit anti-rat NPC1L1 antisera 
A0715 or A0868 at a 1: 500 dilution at 4°C, followed by incubation with biotinylated goat anti- 
rabbit IgG and with streptavidin-peroxidase. Subsequently, the sections were developed in an 
aminoethyl carbazole (AEC)-H202 staining system and counterstained with hematoxylin and 
examined by microscopy. A positive reaction using this protocol is characterized by the 
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deposition of a red reaction product at the site of the antigen-antibody reaction. Nuclei appeared 
blue from the hematoxylin counterstain. Controls were performed simultaneously on the 
neighboring sections from the same tissue block. Control procedures consisted of the following: 
(1) substitute the primary antibody with the pre-immune serum, (2) substitute the primary 
antibody with the non-immune rabbit serum, (3) substitute the primary antibody with PBS, (4) 
substitute the second antibody with PBS. 

[0242] The example shows tissue stained with anti-rat NPC1L1 sera A0715 or with the 
preimmune sera analyzed at low magnification (40X) and at high magnification (200X). The 
A0715-stained tissue, at low magnification, showed positive, strong staining of the villi epithelial 
layer (enterocytes). The A0715-stained tissue at high magnification showed positive, strong 
staining of the enterocyte apical membranes. No staining was observed in tissue treated only 
with preimmune sera. Similar results were obtained with sera A0868. These data indicate that 
rat NPC1L1 is expressed in rat jejunum which is consistent with a role in intestinal cholesterol 
absorption. 

Example 18: Labeled Cholesterol Uptake Assay. 

[0243] In this example, the ability of CHO cells stably transfected with rat NPC1L1 to take 
up labeled cholesterol was evaluated. In these assays, cholesterol uptake, at a single 
concentration, was evaluated in a pulse-chase experiment. The data generated in these 
experiments are set forth, below, in Table 3. 

[0244] Cells: 

A. CHO cells stably transfected with rat NPC1L1 cDNA 
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B. CHO background (no transfection) 
[0245] Cells were seeded at 500,000 cells/ well (mL) in 12-well plates. 

Procedure: 

[0246] All reagents and culture plates were maintained at 37°C unless otherwise noted. 

[0247] Starve. The maintenance media (F12 HAMS, l%Pen/Strep, 10%FCS) was removed 
and the cells were rinsed with serum-free HAMS media. The serum-free media was then 
replaced with 1 mL "starve" media (F12 HAMS, Pen/Strep, 5% lipoprotein deficient serum 
(LPDS). 

[0248] One plate of each cell line was starved overnight. The remaining 2 plates were 
designated "No Starve" (see below). 

[0249] Pre-Incubation. Media was removed from all plates, rinsed with serum-free HAMS 
and replaced with starve media for 30 minutes. 

[0250] 3 H-Cholesterol Pulse. The following was added directly to each well. 

[0251] 0.5uCi 3 H-cholesterol (~1 . 1 X 10 6 dpm/well) in 50ul of a mixed bile salt micelle. 

4.8mM sodium taurocholate (2.581mg/mL) 

0.6 mM sodium oleate (0.183mg/mL) 

0.25 mM cholesterol (0.1 mg/mL) 

Dispersed in "starve" media by ultrasonic vibration 

Final media cholesterol concentration = 5ug/mL 



NY02:47 1952.1 



-81- 



P36134 074669.0105 



[0252] Labeled cholesterol pulse time points were 0, 4, 12 and 24 minutes. Triplicate wells 
for each treatment were prepared. 

[0253] Wash. At the designated times, media was aspirated and the cells were washed 
once with Hobbs Buffer A (50mM Tris, 0.9% NaCl, 0.2% BSA, pH 7.4) and once with Hobbs 
Buffer B (50mM Tris, 0.9% NaCl, pH 7.4 (no BSA)) at 37oC. 

[0254] Processing/Analysis. Cells were digested overnight with 0.2N NaOH, 2mL/well 
at room temperature. One 1.5 mL aliquot was removed from each well, neutralized & counted 
for radioactivity by scintillation counting. Two additional 50ul aliquots from all wells are 
assayed for total protein by the Pierce micro BCA method. The quantity of labeled cholesterol 
observed in the cells was normalized by the quantity of protein in the cells. 
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Table 3. Uptake of 3H-cholesterol by CHO cells transfected with rat NPC1L1 or mouse 
SR-B1 or untransfected CHO cells. 



TitTK* mln 

llllffp) uuu 


Total Cholesterol, dpm protein ± sem 




i Total Cholesterol, di 


pm/mg protein ± sem 


NPC1L1 1 


CHO 




NPC1L1 


CHO 


Afters-Cholesterol 








No Starve 








0 


2067 


±46 


4568 


±1937 






10754 


±166 


22881 ±9230 


4 


2619 


±130 


2868 


±193 






15366 


±938 


15636 ±1471 


12 


2868 


±193 


4459 


±170 






15636 


±1471 


24622 ±966 


24 


7010 


±89 


7204 


±173 






41129 


±685 


J7J01 ±13)1 










S 


ar 


ve 








0 


1937 


±273 


2440 


±299 






10909 


±1847 


12429 ±1673 


4 


3023 


±308 


2759 


±105 






17278 


±1650 


14307 ±781 


12 


2759 


±105 


4857 


±186 






14307 


±781 


26270 ±1473 


24 


6966 


±72 


7344 


±65 






39196 


±174 


38381 ±161 



dpm=disintegrations per minute 
sem=standard error of the mean 



Example 19: Effect of Ezetimibe on Cholesterol Uptake. 

[0255] The effect of ezetimibe on the ability of CHO cells stably transfected with mouse or 
rat NPC1L1 or mouse SR-B1 to take up 3 H-labeled cholesterol was evaluated in pulse-chase 
experiments. One cDNA clone of mouse NPC1L1 (C7) and three clones of rat NPC1L1 (C7, 
CI 7 and C21) were evaluated. The ability of CHO cells stably transfected with mouse SR-B1, 
mouse NPC1L1 and rat NPC1L1 to take up labeled cholesterol, in the absence of ezetimibe, was 



NY02:471 952.1 



-83- 



P36134 074669.0105 



also evaluated in the pulse-chase experiments. Data generated in these experiments are set forth, 
below, in Tables 4 and 5. Additionally, the quantity of total cholesterol taken up by transfected 
and untransfected CHO cells in the presence of four different unlabeled cholesterol 
concentrations was also evaluated. The data from these experiments is set forth, below, in Table 
6. 

[0256] Cells: 

A. CHO cells stably transfected with rat or mouse NPC1L1 cDNA 

B. CHO background (no transfection) 

C. SR-B1 transfected CHO cells 

Cells seeded at 500,000 cells / well (mL) in 12-well plates. 

Procedure: 

[0257] All reagents and culture plates were maintained at 37°C unless otherwise noted. 

[0258] Starve. The maintenance media (F12 HAMS, l%Pen/Strep, 10%FCS) was removed 
and the cells were rinsed with serum-free HAMS media. The serum-free media was then 
replaced with 1 mL "starve" media (F12 HAMS, Pen/Strep, 5% lipoprotein deficient serum 
(LPDS). The cells were then starved overnight. 

[0259] Pre-Incubation/ pre-dose. Media was removed from all plates and replaced with 
fresh starve media and preincubated for 30 minutes. Half of the wells received media containing 
ezetimibe (stock soln in EtOH; final cone. = lOuM). 

[0260] 3 H-Cholesterol Pulse. The following was added directly to each well: 
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0.5uCi 3 H-cholesterol (-1.1 X 106 dpm/well) in 50|ul of a mixed bile salt micelle 

4.8mM sodium taurocholate (2.581mg/mL) 

0.6 mM sodium oleate (0.183mg/mL) 

0.25 mM cholesterol (0.1 mg/mL) 

Dispersed in "starve" media by ultrasonic vibration 

Final media cholesterol concentration = 5ug/mL 
[0261] Labeled cholesterol pulse time points were 4, 12, 24 minutes and 4 hours. Triplicate 
wells were prepared for each treatment. 

[0262] Wash. At designated times, media was aspirated and cells were washed once 
with Hobbs Buffer A (50mM Tris, 0.9% NaCl, 0.2% bovine serum albumin (BSA), pH 7.4) and 
once with Hobbs Buffer B (50mM Tris, 0.9% NaCl, pH 7.4 (no BSA)) at 37oC. 

Processing/Analysis. 

[0263] A. 4, 12, 24 minute time points : Cells were digested overnight with 0.2N 

NaOH, 2mL/well, room temperature. One 1.5 mL aliquot was removed from each well, 
neutralized & counted for radioactivity by scintillation counting. 

[0264] B. 4 hour time point : The digested cells were analyzed by thin-layer 

chromatography to determine the content of cholesterol ester in the cells. 

[0265] Extracts were spotted onto TLC plates and run for 30 minutes in 2 ml hexane: 
isopropanol (3: 2) mobile phase for 30 minutes, followed by a second run in 1ml hexane: 
isopropanol (3: 2) mobile phase for 15 minutes. 
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[0266] C. Protein determination of cell extracts . Plates containing a sample of the 

cell extracts were placed on orbital shaker at 120 rpm for indicated times and then extracts are 
pooled into 12 X 75 tubes. Plates were dried and NaOH (2ml/well) added. The protein content 
of the samples were then determined. Two additional 50ul aliquots from all wells were assayed 
for total protein by the Pierce micro BCA method. The quantity of labeled cholesterol observed 
in the cells was normalized to the quantity of protein in the cells. 
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Table 4. Total Cholesterol in Transfected CHO Cells in the Presence and Absence of 
Ezetimibe. 





Total Cholesterol, dpm± sem 




Total Cholesterol, dpnVms protein ± sem 




Vehicle 


EZHOuM) 




Vehicle 


EZdOuM) 


Pones: 








4MlnPulse 








CHO Control 


3413 


J417 


3222 


±26 




33443 


±4070 


31881 


±483 


SR-BI 


14207 


±51 


10968 


±821 




118242 


±1261 


92474 


±2902 


mNPClLl(C7) 


4043 


±419 


4569 


±222 




30169 


±3242 


30916 


±1137 


rNFClLl(C21) 


3283 


±288 


3769 


±147 




23728 


±2111 


27098 


±689 


rNPClLl(C17) 


3188 


±232 


3676 


±134 




24000 


±832 


28675 


±527 


rNPaLl(C7) 


1825 


±806 


3268 


±121 




15069 


±6794 


27285 


±968 










12 M 


n 


Pulse 








CHO Control 


4710 


±246 


4532 


±165 




44208 


±2702 


43391 


±1197 


CD DI 


16970 


±763 


12349 


±298 




140105 


±6523 


98956 


±4447 


mSPClLl(C7) 


6316 


±85 


6120 


±755 




45133 


±342 


41712 


±4054 


rNPClLl(C21) 


5340 


±12 


4703 


±231 




40018 


±1181 


33985 


±1928 


rNPClLl(C17) 


4831 


±431 


4579 


±257 




37378 


±3461 


34063 


±1619 


rNPaLl(C7) 


4726 


±272 


4664 


±63 




39100 


±2350 


38581 


±784 














24Mln 


Pulse 








CHO Control 


7367 


±232 


6678 


±215 




65843 


±1281 


61764 


±2131 


SR-BI 


39166 


±2152 


23558 


±1310 




324126 


±11848 


198725 


±11713 


n*IPClLl(C7) 


10616 


±121 


9749 


±482 




77222 


±1040 


74041 


±3670 


rNPClLl(C21) 


9940 


±587 


8760 


±293 




76356 


±9618 


66165 


±2181 


rNPClLl(C17) 


8728 


±721 


8192 


±237 




70509 


±5189 


62279 


±4352 


rNPaLl(C7) 


8537 


±148 


7829 


±204 




72134 


±1305 


63482 


±368 



EZ = ezetimibe 
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Table 5. Cholesterol Ester in CHO cells in the Presence or Absence of Ezetimibe. 



Table 5. Cholesterol Ester in CHO cells in the Presence or Absence of Ezetimibe. 





Cholesterol Ester,i 


lpm±sem 






Cbolesteryl Ester, d 


pirVmg protein ± sem 




Vehicle 


EZaOuM) 




Vehicle 


EZ(lOuM) 


Clones: 








4 Hour Pulse 






l.hu control 


652 


±13 


208 


±9 




5647 


±55 


1902 ±87 


SR-BI 


47608 


±1292 


9305 


±401 




391067 


±14391 


72782 ±3181 


mNPClLl(C7) 


732 


±127 


453 


±118 




4994 


±827 


3057 ±776 


rNPClLl(C21) 


2667 


±90 


454 


±33 




18655 


±1032 


3193 ±265 


rNPOLl(a7) 


751 


±74 


202 


±10 




5379 


±481 


1510 ±62 


rNPClLtfCT) 


462 


±25 


191 


±54 




3597 


±193 


1496 ±403 




Free Cholesterol, dpm ± sem 




Free Cholesterol, d 


MiVmg protein ± sem 




Vehicle ! 


EZ(lOuM) 




Vehicle 


Uj 1 JUUIVII 










4 Hour 


*ulse 






CHO Control 


61612 


±1227 


56792 


±568 




533876 


±17770 


519607 ±16203 


SR-BI 


214678 


±4241 


194519 


±474 




1762873 


±46607 


1521341 ±4185 


mNPClLl(C7) 


79628 


±793 


77516 


±1910 




544661 


±1269 


523803 ±10386 


rNPClLl(C21) 


71352 


±1343 


69106 


±711 




498016 


±8171 


485460 ±4410 


rNPClLl(C17) 


78956 


±3782 


71646 


±446 




566456 


±29204 


536651 ±7146 


rISPClLl(C7) 


75348 


±2093 


70628 


±212 




586127 


±13932 


556855 ±7481 



EZ =ezetimibe 
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Table 6. Uptake of labeled cholesterol in the presence of increasing amounts of unlabeled 
cholesterol. 





yygioiiiiSi 


Total Cholesterol, dpm ± sem 


, 


I Total Cholesterol, dpm/mg protein ± tern 










1 CHO Control 1 


SH-B1 


| lf mNPCll,l(C71 


rf<PCl,I,l(C21) 


Cold Cbolaitf rol 






1 


M MId Pulie 








3ug/nL 


12271 ±430 


49603 ±2428 


14230 ±1628 


I0656 H233 




108936 ±5413 


541562 ±13783 


140764 ±14433 


94945 ±12916 


1 A nn/ml 

iv jig/mL 


16282 ±2438 


79967 ±8131 


25465 ±3037 


13225 ±4556 




151283 ±23343 


880224 ±82254 


250985 ±27481 


123433 ±34092 


30 (ig/mL 


U758 ±1607 


7I92J ±3863 


19001 ±1330 


13218 ±1149 




135109 ±12106 


796236 ±18952 


180436 ±12112 


II 1522 ±6941 


100 ug/mL 


16438 ±1614 


38183 ±4348 


15973 ±1663 


11560 ±1132 




149559 ±17977 


630143 ±3718 


147717 ±8261 


101328 ±7191 






Cholesteryl Ester, dpm * sem 






Cholesteryl Ester. dom/mir protein ± sem 






mm 


1 unci m is r«ui till 14 1 1 fWI 11 




CHO Control 1 


SR-BI 


| mNPCll.lfC7) 


rf!PClLl{C21) 










4 Hour Pulie 








3 ug/mL 


2737 ±114 


39396 ±1241 


1561 ±1 


4015 ±47 




22050 ±978 


382641 ±3955 


13684 ±217 


32020 ±64) 


1A nn Iwwh 1 

■v pig/niLi 


1646 ±76 


17292 ±362 


998 ±36 


1866 ±33 




13323 ±606 


157914 ±3400 


8917 ±467 


14849 ±127 


30 ug/mL 


970 ±46 


6642 ±133 


337 ±82 


970 ±9 




7627 ±325 


63547 ±1760 


4885 ±748 


7741 ±100 


100|ig/mL 


895 ±156 


4777 ±27 


405 ±7 


777 ±16 




7135 ±1230 


45088 ±1326 


3663 ±68 


6005 ±198 




gBRiqli 


Free CholesteroL dpm ± sen) 






Free Cholesterol, dpm/rai protein ± se 


HI 








Itrnci MjjK4iilVll4IM4Ul 




CHO Control 1 


SR-BI 


| m|SPClLlfC71 


rNPClLl(C21) 










4 Hour Pulse 








3 Ug/mL 


89013 ±3724 


211783 ±3268 


104343 ±2112 


92244 ±987 




717308 *34130 


2047695 ±16213 


914107 ±5869 


735498 ±11209 


Ifl no/ml 
IV pg/mb 


136396 ±8366 


278216 ±10901 


196173 ±4721 


125144 ±877 




1105118 ±76074 


2540130 ±92471 


1753072 ±86578 


996824 ±27850 


30 iig/mL 


131745 ±2922 


224429 ±2336 


149172 ±19689 


117143 ±4976 




1036195 ±21142 


2149315 ± 78068 


1357136 ±180264 


934772 ±43202 


lOOpg/ml 


79336 ±4011 


231470 ±4221 


114599 ±2803 


93538 ±1588 




632965 ±29756 


2182022 ±36793 


1035979 ±30329 


723225 ±21694 






Cholesteryl Ester, dpm * »em 






Cholesteryl Esler. dpm/mg protein £ sem 












CHO Control 1 














24 Hour Pulie 






■■■■■ 


3 pg/mL 


J7373 ±2704 


162296 ±1644 


22986 ±940 


59377 ±953 




357629 ±14639 


1248900 ±18363 


160328 ±6363 


401315 ±3557 


10ng/roL 


33730 ±1296 


112813 ±373 


14836 ±532 


31797 ±523 




215004 ±5942 


B30231 ±12764 


98594 ±4203 


200451 ±5239 


30 pg/mL 


19193 ±100 


58668 ±1413 


8878 ±355 


18963 ±380 




122071 ±1271 


446581 ±3472 


59091 ±2697 


119728 ±2131 


100 pg/mL 


16761 ±398 


31280 ±1270 


8784 ±946 


14933 ±311 




103235 ±1739 


272796 ±13392 


60670 ±4397 


96215 ±1023 






Free CholesteroL dpm ± tern 






Free Cholesterol, dnm/ms protein ± tm 




CHO Control 1 


SR-BI 


lmNPClLHC7)l 


rNPClLlfC21> 




CHO Control 1 


SR-BI 


1 mNPClLWCTl 


rNPClLHC2n 










24 Hour Pulse 








3 Ug/mL 


248983 ±4207 


357819 ±4319 


285610 ±3187 


227244 ±1016 




1552637 ±18954 


2752957 ±24984 


1993256 ±36968 


1536023 ±10304 


lOiig/mL 


231208 ±8927 


269822 ± 3872 


3 11777 ±8227 


231666 ±6198 




1477414 ±83954 


1984473 ±18420 


2069980 ±25517 


1461 157 ±58517 


30 (if/mL 


203366 ±6008 


225273 ±3932 


279604 ±6612 


209372 ±3386 




1294878 ±41819 


1716066 ±52381 


1859476 ±29307 


1321730 ±5452 


100|ig/mL 


178424 ±2379 


167082 ±2211 


229832 ±4199 


182678 ±7709 




1099648 ±23160 


1455799 ±9885 


1599244 ±76938 


1177546 ±51 191 
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Example 20: Labeled Cholesterol Uptake Assay. 

[0267] In this example, the ability of CHO cells transiently transfected with rat NPC1L1 or 
mouse SR-B1 to take up labeled cholesterol was evaluated. Also evaluated was the ability of rat 
NPC1L1 to potentiate the ability of CHO cells transfected with mouse SR-B1 to take up labeled 
cholesterol. In these assays, cholesterol uptake, at a single concentration, was evaluated in pulse- 
chase experiments. The data generated in these experiments are set forth, below, in Table 7. 

[0268] Cells: 

A. CHO background cells (mock transfection). 

B. CHO cells transiently transfected with mouse SR-B1 . 

C. CHO transiently transfected with rat NPC1L1 cDNAs (n=8 clones). 
[0269] Transiently transfected cells were seeded at 300,000 cells / well (mL) in 12-well plates. 

Procedure: 

[0270] All reagents and culture plates were maintained at 37°C unless otherwise noted. 

[0271] Starve. The maintenance media (F 12 HAMS, l%Pen/Strep, 10%FCS) was removed 
from the cells and replaced with 1 mL "starve" media (F12 HAMS, Pen/Strep, 5% lipoprotein 
deficient serum (LPDS). Cells were starved for 1 hour. 

[0272] 3 H-CholesteroI Pulse. The following was added directly to each well. 

[0273] 0.5uCi 3 H-cholesterol (-1.1 X 106 dpm/well) in 50ul of a mixed bile salt micelle. 

4.8mM sodium taurocholate (2.581mg/mL) 
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0.6 mM sodium oleate (0.183mg/mL) 

0.25 mM cholesterol (0.1 mg/mL) 

Dispersed in "starve" media by ultrasonic vibration 

Final media cholesterol concentration = 5ug/mL 
[0274] Labeled cholesterol pulse time points were 24 Min and 4 hours. Triplicate wells for each 
treatment. 

[0275] Wash. At the designated times, media was aspirated and cells were washed once with 
Hobbs Buffer A (50mM Tris, 0.9% NaCl, 0.2% BSA, pH 7.4) and once with Hobbs Buffer B 
(50mM Tris, 0.9% NaCl, pH 7.4 (no BSA)) at 37oC. 

Processing/ Analysis. 

[0276] A. 24 minute time point : Cells were digested overnight with 0.2N NaOH, 

2mL/well at room temp. One 1.5 mL aliquot was removed from each well, neutralized & 
counted for radioactivity by scintillation counting. 

[0277] B. 4 hour time point : The digested cells were analyzed by thin-layer 

chromatography to determine the content of cholesterol ester in the cells. 

[0278] The extracts were spotted onto thin layer chromatography plates and run in 2 ml hexane: 
isopropanol (3: 2) containing mobile phase for 30 minutes, followed by a second run in 1ml 
hexane: isopropanol (3: 2) containing mobile phase for 15min. 

[0279] C. Protein determination of cell extracts : Plates containing a sample of the 

cell extracts were placed on orbital shaker at 120 rpm for indicated times and then extracts are 
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pooled into 12X75 tubes. Plates were dried and NaOH (2ml/well) added. The protein content of 
the samples were then determined. Two additional 50ul aliquots from all wells were assayed for 
total protein by the Pierce micro BCA method. The quantity of labeled cholesterol observed in 
the cells was normalized to the quantity of protein in the cells. 
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Table 7. Labeled cholesterol uptake in transiently transfected CHO cells. 





Total Cholesterol, ± sem 




dpm 




dpm/mg protein 


Transfection 


24 Min Pulse 


pilrt Cimtrnl fmnt>lz\ 


4721 ±436 




49024 ±4328 


otv-i»n i ransienij 


5842 ±82 




59445 ±1099 1 




4092 ±377 




47026 ±2658 


SR-BI/NPC1L1 (trans) 


3833 ±158 




52132 ±3071 




Cholesteryl E 


■ster, ± sem 




dpm 




dpm/mg protein 




4 Hour Pulse 


CHO Control frmwkl 


2132 ±40 




20497 ±640 


SR-BinVn n«l«« nf\ 


5918 ±237 




51812 ±1417 


NPCIf 1 rrrn n«ste nt\ 

i "IV/IL/I ^ I r 3UIAICI11J 


1944 ±93 




19788 ±642 ! 


SR-BI/NPC1L1 (trans) 


4747 ±39 




58603 ±1156 




Free Cholesterol, ± sem 




dpm 




dpm/mg protein 




4 Hour Pulse 


CHO Control (mock) 


45729 ±328 




439346 ±5389 


SR-BI(Transient) 


50820 ±2369 




444551 ±9785 


NPC1L1 (Transient) 


39913 ±1211 




406615 ±6820 


SR-BI/NPC1L1 (trans) 


37269 ±1225 




459509 ±6195 



Example 21: Expression of rat, mouse and human NPC1L1. 

[0280] In this example, NPC1L1 was introduced into cells and expressed. Species specific 
NPC1L1 expression constructs were cloned into the plasmid pCDNA3 using clone specific PCR 
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primers to generate the ORF flanked by appropriate restriction sites compatible with the 
polylinker of the vector. For all three species of NPC1L1, small intestine total tissue RNA was 
used as a template for reverse transcriptase-polymerase chain reaction (RT-PCR) using oligo dT 
as the template primer. The rat NPC1L1 was cloned as an EcoRI fragment, human NPC1L1 was 
cloned as a Xbal/NotI fragment and mouse NPC1L1 was cloned as an EcoRI fragment. Forward 
and reverse strand sequencing of each clone was performed to confirm sequence integrity. 
Standard transient transfection procedures were used with CHO cells. In a 6-well plate CHO 
cells were plated 1 day before transfection at a plating density of 2 X 10 5 cells/well. The 
following day, cells were incubated with 2 ug plasmid DNA and 6 uL Lipofectamine for 5 hours 
followed a fresh media change. Forty-eight hours later, cells were analyzed for NPC1L1 
expression using anti-NPClLl antisera by either FACS or western blot. To establish stable long 
term cell lines expressing NPC1L1, transfected CHO cells were selected in the presence of 
geneticin (G418, 0.8 mg/ml) as recommended by the manufacturer (Life Technologies). 
Following one month of selection in culture, the cell population was stained with anti-NPClLl 
antisera and sorted by FACS. Individual positive staining cells were cloned after isolation by 
limiting dilution and then maintained in selective media containing geneticin (0.5 mg/ ml). 

[0281] Other cell types less susceptible to transfection procedures have been generated using 
adenoviral vector systems. This system used to express NPC1L1 is dervied from Ad 5, a type C 
adenovirus. This recombinant replication-defective adenoviral vector is made defective through 
modifications of the El, E2 and E4 regions. The vector also has additional modifications to the 
E3 region generally affecting the E3b region genes RIDa and RIDb. NPC1L1 expression was 
driven using the CMV promoter as an expression cassette substituted in the E3 region of the 
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adenovirus. Rat and mouse NPC1L1 were amplified using clone specific primers flanked by 
restriction sites compatible with the adenovirus vector Adenovirus infective particles were 
produced from 293-D22 cells in titers of 5 X 10 10 P/mL. Viral lysates were used to infect cells 
resistant to standard transfection methodologies. In Caco2 cells, which are highly resistant to 
heterologous protein expression, adenovirus mediated expression of NPC1L1 has been shown by 
western blot analysis to persist at least 21 days post-infection. 

Example 22: NPC1L1 Knock-Out Transgenic Mouse. 

[0282] NPC1L1 knockout mice were constructed via targeted mutagenesis. This methodology 
utilized a targeting construct designed to delete a specific region of the mouse NPC1L1 gene. 
During the targeting process the E. coli lacZ reporter gene was inserted under the control of the 
endogenous NPC1L1 promoter. The region in NPC1L1 (SEQ ID NO: 45) being deleted is from 
nucleotide 790 to nucleotide 998. The targeting vector contains the LacZ-Neo cassette flanked 
by 1.9 kb 5' arm ending with nucleotide 789 and a 3.2 kb 3' arm starting with nucleotide 999. 
Genomic DNA from the recombinant embryonic stem cell line was assayed for homologous 
recombination using PCR. Amplified DNA fragments were visualized by agarose gel 
electrophoresis. The test PCRs employed a gene specific primer, which lies outside of and 
adjacent to the targeting vector arm, paired with one of three primers specific to the LacZ-Neo 
cassette sequence. For 5' PCR reconfirmation, the NPC1L1 specific oligonucleotide 
ATGTTAGGTGAGTCTGAACCTACCC (SEQ ID NO: 46) and for 3 'PCR reconfirmation the 
NPC1L1 specific oligonucleotide GGATTGCATTTCCTTCAA GAAAGCC (SEQ ID NO: 47) 
were used. Genotyping of the F2 mice was performed by multiplex PCR using the NPC1L1 
specific forward primer TATGGCTCTGCCC TCTGCAATGCTC (SEQ ID NO: 48) the LacZ- 
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Neo cassette specific forward primer TCAGCAGCCTCTGTTCCACATACACTTC (SEQ ED 
NO: 49) in combination with the NPC1L1 gene specific reverse primer 
GTTCCACAGGGTCTGTGGTGAGTTC (SEQ ID NO: 50) allowed for determination of both 
the targeted and endogenous alleles. Analysis of the PCR products by agarose gel 
electrophoresis distinguished the wild-type, heterozygote and homozygote null mouse from each 



Example 23: Acute Cholesterol Absorption in NPCILI-Deficient Mice. 

[0283] To determine whether NPC1L1 plays a role in cholesterol absorption, NPC1L1 deficient 

mice were studied. 

[0284] Mice deficient in NPC1L1 (-/-) were generated by breeding heterozygote mice (+/) to 
obtain wild-type (+/+) and NPC1L1 deficient mice (-/-). Non-fasted mice (6.5-9 weeks old, 
mixed 129 and C57BL/6 background) were weighed and grouped (n=2 -/- and n=4 +/+). All 
animals were gavaged (Feeding needles, 24G x 1 inch, Popper and Sons, NY) with 0.1 ml corn 
oil (Sigma; St. Louis, MO) containing luCi 14 C-cholesterol (New England Nuclear, [ 4 " 14 C] 
Cholesterol, NEC-018) and 0.1 mg carrier cholesterol mass (Sigma; St. Louis, MO). Two hours 
later, blood was collected by heart puncture. The liver was removed, weighed, and three samples 
were placed into 20 ml counting vials. Tissues were digested in 1 ml of IN NaOH at 60°C 
overnight. The tissue digests were acidified by addition of 250ul of 4N HC1 prior to liquid 
scintillation counting (LSC). Plasma was isolated by centrifugation at 10,000 rpm for 5 minutes 
in a micro fuge and duplicate lOOul aliquots of plasma were taken for LSC. 



other. 
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[0285] Cholesterol absorption, evaluated by this acute technique and expressed as the total 
amount of radioactive cholesterol in the plasma and liver, demonstrated that the wild type mice 
(+/+) absorbed an average of 1 1,773 dpm and NPC1 LI deficient mice absorbed 992 dpm of the 
14C-cholesterol. These results indicate that the NPC1L1 deficient mice have a 92% reduction in 
cholesterol absorption. These data confirm the role of NPC1L1 in intestinal cholesterol 
absorption. Inhibition of NPC1 LI -mediated cholesterol absorption, in a subject, by 
administering NPC1L1 antagonists, such as ezetimibe, to the subject, are a useful way to reduce 
serum cholesterol levels and the occurrence of atherosclerosis in the subject. 

Example 24: Cholesterol Absorption in NPC1L1 (NPC3) Knockout Mice (Fecal Ratio 
Method: Cholesterol/Sitostanol). 

[0286] In this example, cholesterol absorption and the activity of ezetimibe was determined in 
the NPC1L1 knockout mice (-/-), heterozygous mice (+/-), and age matched wild-type mice 
(+/+). 

[0287] Cholesterol absorption in the mice was determined by the dual fecal isotope ratio method 
as described by Altmann et al. (Biochim. Biophys. Acta. 1580(1): 77-93 (2002)). Mice (n= 4- 
6/group) were fed a standard rodent chow diet and in some groups treated daily with a maximally 
effective dose of ezetimibe (10 mg/kg). Mice were gavaged with 14 C-cholesterol (luCi, O.lmg 
unlabeled cholesterol) and 3 H-sitostanol (2uCi) in 0.1ml corn oil. Feces were collected for 2 
days and fecal 14 C-cholesterol and 3 H-sitostanol levels were determined by combustion in a 
Packard Oxidizer. The fraction of cholesterol absorbed, as evaluated by the fecal dual isotope 
technique, was similar in wild type (+/+) and heterozygous mice (+/-) fed a chow diet 
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(heterozygous mice absorbed 46 ±5% and age matched wild type mice absorbed 51 ±3% of the 
dose of 14 C-cholesterol). The NPC1L1 knockout mice (-/-) absorbed 15.6 ±0.4% of the 14 C- 
cholesterol, which was similar to the wild type mice treated with a maximally effective dose of 
ezetimibe (16.1 ±0.3%), and reduced by 69% compared to wild type mice (pO.OOl). In NPC1L1 
knockout treated with ezetimibe at 10 mg/kg/day, cholesterol absorption was similar to that seen 
in the untreated knockout mice (16.2 ±0.6% compared to 15.6% ±0.4%, respectively). Thus, the 
majority of cholesterol absorption is dependent on the presence of NPC1L1 and the residual 
cholesterol absorption in mice lacking NPC1L1 is insensitive to ezetimibe treatment. These 
results indicate that NPC1L1 is involved in the small intestinal enterocyte uptake and absorption 
of cholesterol and is in the ezetimibe sensitive pathway. 

Example 25: Mouse Screening Assay (Acute Cholesterol Absorption). 

[0288] The following screening assay is used to identify the presence of an NPC1L1 antagonist 

in a sample. 

[0289] Mice deficient in NPC1L1 (-/-) are generated by breeding heterozygote mice (+/) to 
obtain wild-type (+/+) and NPC1L1 deficient mice (-/-). 

[0290] In a first set of experiments, non-fasted mice (6.5-9 weeks old, mixed 129 and C57BL/6 
background) are weighed and grouped (n=l to 4 -/- and n=l to 4 +/+). All animals are gavaged 
(Feeding needles, 24G x 1 inch, Popper and Sons, NY) with 0.1 ml corn oil (Sigma; St. Louis, 
MO) containing luCi 14 C-cholesterol (New England Nuclear, [ 414 C] Cholesterol, NEC-018) and 
O.lmg carrier cholesterol mass (Sigma; St. Louis, MO). 
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[0291] In another set of experiments, 1 to 4 wild-type NPC1L1 mice (+/+) are treated identically 
to the mice in the first set of experiments, above, except that the mice are additionally fed a 
sample to be tested for the presence of an NPC1L1 antagonist. 

[0292] Two hours later, blood is collected from each mouse by heart puncture. The liver is 
removed, weighed, and three samples are placed into 20 ml counting vials. Tissues are digested 
in 1 ml of IN NaOH at 60°C overnight. The tissue digests are acidified by addition of 250ul of 
4N HC1 prior to liquid scintillation counting (LSC). Plasma is isolated by centrifugation at 
10,000 rpm for 5 minutes in a micro fuge and duplicate lOOul aliquots of plasma are taken for 
LSC. 

[0293] Cholesterol absorption, evaluated by this acute technique is expressed as the total amount 
of radioactive cholesterol in the plasma and liver. The sample tested is determined to contain an 
NPC1L1 antagonist when the level of cholesterol absorption (as measured by the above 
described methods) in the wild-type NPC1L1 mouse (+/+) which was fed the sample and in the 
NPC1L1 deficient mouse (-/-) are less than the amount of cholesterol absorption in the wild-type 
NPC1L1 mouse (+/+) which was not fed the sample. 

Example 26: Mouse Screening Assay (Fecal Ratio Method: Cholesterol/Sitostanol). 

[0294] The following screening assay is used to identify the presence of an NPC1L1 antagonist 
in a sample. 

[0295] Cholesterol absorption in the mice is determined by the dual fecal isotope ratio method as 
described by Altmann et al. (Biochim. Biophys. Acta. 1580(1): 77-93 (2002)). 
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[0296] Three groups of mice (n=l-6/group) are assembled. Two separate groups comprise wild- 
type NPC1L1 mice (+/+) and one group comprises NPC1L1 deficient mice (-/-). 

[0297] Each group is fed a standard rodent chow diet and in some groups treated daily. Mice are 
gavaged with 14 C-cholesterol (luCi, O.lmg unlabeled cholesterol) and 3 H-sitostanol (2uCi) in 
0.1ml corn oil. One group of mice, which comprise wild-type NPC1L1 mice (+/+) are further 
fed a sample to be tested for the presence of an NPC1L1 antagonist. Feces are collected for 2 
days and fecal 14 C-cholesterol and 3 H-sitostanol levels are determined by combustion in a 
Packard Oxidizer. 

[0298] The sample tested is determined to contain an NPC1L1 antagonist when the level of 
cholesterol and/or sitostanol absorption (as measured by the above described methods) in the 
wild-type NPC1L1 mouse (+/+) which was fed the sample and in the NPC1L1 deficient mouse (- 
/-) are less than the amount of cholesterol and/or sitostanol absorption in the wild-type NPC1L1 
mouse (+/+) which was not fed the sample. 

Example 27: Binding Analysis Using Brush Border Membrane Vesicles 

[0299] The following screening assay may be used to identify the presence of an NPC1L1 ligand 

in a sample. 

[0300] Materials. The following two compounds were synthesized for the binding assay 
described herein , 3 H-ezetimibe glucuronide 1 (34.5 Ci/mmol) and its 35 S-propargyl-sulfonamide 
derivative 2 (800-1 100 Ci/mmol). 
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[0301] Synthesis of ezetimibe glucuronide and S-propargyl-suIfonamide ezetimibe- 
glucuronide. Ezetimibe glucuronide (compound 1) (also referred to as EZE-glucuronide) can be 
made according to the procedures in U.S. Patent No. 5,756,470. The general scheme below 
illustrates a method for the synthesis of compound 2 and radiolabeled 35 S-2. 
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[0302] Preparation of compound S-2 (Compound 2 with radiolabeled S) 

[0303] Step A: Preparation of [MS]N-prop-2-yn-l-ylmethanesulfonamide (i). The 

appropriate volume of [35s]methane sulfonyl chloride (see Dean, D.C.; et al., J. Med. Chem. 
1996, 39, 1767) totaling 3.5 mCi was removed from a stock solution in methylene chloride and 
placed in a 5mL conical flask. It was then distilled at atmospheric pressure until the volume was 
approximately 50 uL. To this solution was immediately added 50 uL of propargylamine. After 
15 min, the reaction mixture was diluted with 10 mL of ethyl acetate, washed with saturated 
sodium bicarbonate solution (3x2 mL), and dried over sodium sulfate. After filtration the 
resulting solution had a count of 3.3 mCi and a radiochemical purity of 99.9 % by HPLC 
(Zorbax XDB C8 column, 4.6 x 150 mm, 5 % acetonitrile:H 2 0 (0.1 % TFA) to 100 % 
acetonitrile, 15 min linear gradient, 1 mL/min, t R = 4.4 min). 

[0304] Step B: Preparation of [35s] 4-[(2S,3/?)-3-[(35)-3-(4-fluorophenyl)-3- 

hydroxypropyl]- 1 -(4- {3-[(methylsulfonyl)amino]prop- 1 -yn- 1 -yl}phenyl)-4-oxoazetidin-2- 
yl]phenyl methyl P-D-glucopyranosiduronate ([35S]) (iii). Dissolved 3.0 mCi of [35s]N-prop- 
2-yn-l-ylmethanesulfonamide, 1 mg of compound ii (prepared according to Burnett, D.S. et al., 
Bioorg. Med. Chem. Lett. (2002), vol. 12, p. 31 1), and 1 uL of triethylamine in 100 uL of 
dimethylformamide inside a plastic microcentrifuge tube. To this solution was added 10 uL of a 
stock solution containing 8.1 mg of tetrakis(triphenylphosphine)palladium(0) and 1.4mg of 
copper iodide in 1 mL of dimethylformamide. Stirred at room temperature for sixty hours at 
which time HPLC indicated 55% conversion. This reaction mixture, which had a radiochemical 
purity of 44.4% by HPLC (Zorbax XDB C8 column, 4.6 x 150 mm, 5 % acetonitrile.H 2 0 (0.1 % 
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TFA) to 100 % acetonitrile, 15 min linear gradient, 1 mL/min, £ R = 9.3 min) was taken on 
directly to the next step. 

[0305] Step C:Preparationof [35S] 4-[(2S,3R)-3-[(3S)-3-(4-fluorophenyl)-3-hydroxypropyl]-l- 
(4- {3-[(methylsulfonyl)amino]prop- 1 -yn- 1 -yl} phenyl)-4-oxoazetidin-2-yl]phenyl p-D- 
glucopyranosiduronic acid 35 S-2. The crude reaction mixture containing compound iii was 
treated with 25 fiL of methanol, 90 uL of water, and 30 u.L of triethylamine and stirred at room 
temperature for one hour at which time it was concentrated to near dryness under a slow stream 
of nitrogen. The residue was dissolved in 1:1 acetonitrile:H20 and subjected to semi-preparative 
chromatography (Zorbax XDB C8 250 x 9.4 mm column, 70:30 acetonitrile:H 2 0 (0.1 % TFA) 4 
mL/min, 1 x 0.2 mL injections). 540 u€i of product was obtained which had a radiochemical 
purity of 99.9% by HPLC (Zorbax XDB C8 column, 4.6 x 150 mm, 70:30 acetonitrile:H 2 0 (0.1 
% TFA), 1 mL/min, t& = 10.4 min) and coeluted with an authentic sample of compound 2. 
LC/MS m/z = 508 (product - glucuronide - H 2 0), SA = 769 Ci/mmol. 

Alternate Preparation of 35 S-2. 

[0306] Step A: Preparation of iii. The appropriate volume of [ 35 S]methane sulfonyl chloride 
(see Dean, D.C.; et al., /. Med. Chem. 1996, 39, 1767) totaling 1.3 mCi was removed from a 
stock solution in methylene chloride and placed in a 5mL conical flask. It was then distilled at 
atmospheric pressure until the volume was approximately 50 uL. To this solution was 
immediately added a solution of 1 mg of v in 5 uL of pyridine (freshly distilled over calcium 
hydride). 
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P 




'OH 



CSC-CH 2 -NH 2 



[0307] The solution was stirred at room temperature for five minutes at which time it was 
concentrated to near dryness under a slow stream of nitrogen. This reaction mixture, which had a 
radiochemical purity of 80.1% by HPLC (Zorbax XDB C8 column, 4.6 x 150 mm, 5% 
acetonitrile:H 2 0 (0.1 % TFA) to 100 % acetonitrile, 15 min linear gradient, 1 mL/min, / R = 9.3 
min) was taken on directly to the next step. 

[0308] Step B Preparation of 35 S-2. The crude reaction mixture containing iii was treated with 
25 uL of methanol, 90 p.L of water, and 30 uL of triethylamine and stirred at room temperature 
for one hour at which time it was concentrated to near dryness under a slow stream of nitrogen. 
The residue was dissolved in 1:1 acetonitrile:H 2 0 and subjected to semi-preparative 
chromatography (Zorbax XDB C8 250 x 9.4 mm column, 70:30 

acetonitrile:H 2 0 (0.1 % TFA) 4 mL/min, 1 x 0.2 mL injections). 350 uCi of product was 
obtained which had a radiochemical purity of 98.4 % by HPLC (Zorbax XDB C8 column, 4.6 x 
150 mm, 70:30 acetonitrile:H 2 0 (0.1 % TFA), 1 mL/min, t K = 10.4 min) and coeluted with an 
authentic sample of 2. LC/MS m/z = 508 (product - glucuronide - H 2 0), SA = 91 1 Ci/mmol. 

[0309] Following the same general procedure for synthesis of 35 S-2, except omitting the 
radiolabelling, compounds 2 and iv can be prepared. 
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[0310] Preparation of brush border membrane vesicles (BBMV). Membranes were 
prepared from Rhesus macaque (Macaca mulatta), rat (male Sprague-Dawley), and mouse (male 
C57BL/6J) intestines, using Mg** precipitation method described in the following references and 
with modifications described below (Hauser, H., Howell, K., Dawson, R.M.C., Bowyer, D. E. 
Biochim. Biophys. Acta 602, 567-577 (1980); Kramer, W., Girbig, F., Gutjahr, U., Kowalewski, 
S., Jouvenal, K., Muller, G., Tripier, D., Wess, G. J. Biol. Chem. 268, 18035-18046 (1993); 
Rigtrup, K.M., Ong, D.E. Biochemistry 31, 2920-2926 (1992)). 

[0311] The intestines from freshly sacrificed animals were cut into segments, perfused with ice- 
cold saline buffer (Buffer A: 26 mM NaHC0 3 , 0.96 mM NaH 2 P0 4) 5 mM HEPES, 5.5 mM 
glucose, 117 mM NaCl, 5.4 mM KC1, pH = 7.4), placed on cold glass plates, opened 
longitudinally, and the mucosa scraped with glass microscope slips. This mucosa could be used 
fresh or frozen with identical results. To prepare the membranes, the mucosal scrapings were 
resuspended in 20 volumes of cold buffer consisting of 300 mM D-mannitol, 5 mM EGTA, 12 
mM Tris, pH 7.4 with HC1, and containing 0.1 mM PMSF and a 1% dilution of a protease 
inhibitor cocktail (set 1 , Calbiochem). They were homogenized using a Polytron at medium 
speed on ice until inspection with a microscope indicated complete cell lysis. Then, solid MgCh 
was added slowly with stirring to a final concentration of 10 mM, and the solution was kept 
stirring on ice for 15 min. Cellular debris was removed by centrifugation for 15 min at 3,000g, 
and the membranes were recovered by centrifugation for 60 min at 48,000g. The membranes 
were further rinsed by re-suspension in a buffer containing 50 mM D-mannitol, 5 mM EGTA, 
and 2 mM Tris at pH 7.40, and centrifugation for 60 min at 48,000g. The final pellet was 
resuspended in 120 mM NaCl and 20 mM Tris at pH 7.40 to a concentration of -10-20 mg 



NY02:471952.1 



-106- 



P36134 074669.0105 



protein/ml, aliquoted, frozen in liquid nitrogen, and stored at -80C. The activity was stable 
indefinitely and could be freeze-thawed with minimal loss. 

[0312] Membrane protein was measured by the Bradford assay (Bradford, M.M. Anal. Biochem. 
72, 248-254 (1976)) using bovine serum albumin as standard. The enrichment in brush border 
membranes was assessed using gamma-glutamyltransferase as a marker enzyme (Kramer, W., 
Girbig, F., Gutjahr, U., Kowalewski, S., Jouvenal, K., Muller, G., Tripier, D., Wess, G. J. Biol. 
Chem. 268, 18035-18046 (1993)), which indicated a 6-fold enrichment over the initial 
homogenate. 

[0313] Binding assay. Assays were conducted in 12 x 75 mm glass test tubes and total volume 
100 ul. In general, frozen membranes were diluted in buffer A to a final concentration of 0.02 to 
5 mg/ml. Radiolabeled ligands were typically 25-50 nM for 3 H-ezetimibe (EZE)glucuronide 1 
(200,000-400,000 dpm), and 3-5 nM for its 35 S analog 2, (300,000-1,000,000 dpm) in the assay, 
and they were delivered as DMSO or CH3CN solutions. Competing ligands were likewise added 
as DMSO solutions to give a total 2-10 % organic solvent content. Nonspecific binding was 
defined by competition with 100 uM ezetimibe glucuronide. At least 2 components of buffer A, 
the bicarbonate and phosphate salts, were later found to be inconsequential and were routinely 
omitted. To ensure equilibrium was established, the reactions with compound 1 were incubated 
at least 3 hours for rhesus membranes and at least one hour for rat membranes at room 
temperature, and with compound 2 as long as 2 hours at 37°C with both rhesus and rat 
membranes. 
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[0314] Bound ligand was quantified by single-tube vacuum filtration using GF/C glass fiber 
filters. Glass fiber filters (GF/C) were obtained from Whatman. The filters were pretreated by 
soaking with 0.5% polyethylenimine to reduce nonspecific binding. Filtration was accomplished 
by adding 2.5 ml of ice cold buffer (120 mM NaCl, 0.1% sodium cholate, and 20 mM MES at 
pH 6.70) to the assay tube, pouring the mixture through the filter, and then rinsing the tube and 
filter twice more with another 2 x 2.5 ml buffer. The filters were counted in 7 ml vials using 
Packard DM liquid scintillation fluid. Where triplicate assays were performed, the standard error 
was typically <4%. As an example, assay of rat brush border membranes at 2 mg/ml in the 
presence of 400,000 dpm (50 nM) 3 H-ezetimibe glucuronide gave 15,000 dpm specific and 3,000 
dpm nonspecific binding. The filters contributed most of the nonspecific binding (2,000 dpm). 

[0315] Alternatively, vacuum filtration of compound 2 on a 96-well plate (Whatman GF/C) can 
also be used to achieve adequate precision. 

[0316] Data analysis. Data from saturation experiments were subjected to a Scatchard analysis, 
and linear regression was performed to yield the equilibrium dissociation constant (Kd) and 
maximum receptor concentration (B ma x)- Correlation coefficients for these determinations were 
typically greater than 0.96. Data from competition experiments were analyzed and IC50 values 
determined from Hill plots of the binding data. The kinetic data for ligand association and 
dissociation were subjected to the analysis of Weiland and Molinoff (Weiland, G, Molinoff, .B. 
Life Sci. 29, 313-330 (1981)). The dissociation rate constant for (k_i) was determined directly 
for a first order plot of ligand dissociation versus time. The rate of ligand association (ki) was 
determined from the equation ki = kob S ([LRe]/([L] [LR]ma X )) where [L] is the concentration of 
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ligand, [LRe] is the concentration of the complex at equilibrium, [LR] max is the maximum 
number of receptors present, and ko bs is the slope of the pseudo-first order plot L„ ([LE]e/([LR] e - 
[LR] t )) versus time. 

[0317] Binding analysis. Ezetimibe is rapidly converted to its glucuronide in vivo, and this 
metabolite is thought to be largely if not exclusively responsible for inhibition of cholesterol 
uptake. Accordingly, both 3 H-ezetimibe and its corresponding glucuronide derivative (1) were 
prepared and tested for binding to intestinal brush border membrane preparations, using a single- 
tube vacuum filtration technique. As a result of the hydrophobic nature of 3 H-ezetimibe, high 
nonspecific binding was observed, precluding its use as a radioligand for the binding assay. 
However, due to the improved physical properties of the glucuronide derivative (1), specific 
binding was observed with this radioligand and it was used to assess binding in rhesus, rat, and 
mouse intestinal brush border membranes. 

[0318] Rhesus, rat, and mouse intestinal scrapings were homogenized and the brush border 
membranes isolated. Specific binding was observed exclusively in the membrane fraction. Plots 
of total, nonspecific, and specific binding to rhesus (Figure 1) and rat (Figure 2) brush border 
membranes. Aliquots of rhesus BBMV (83 pg/assay) or rat BBMV (250ug/assay) were 
incubated with increasing concentrations of 3 H-EZE-glucuronide. Total binding and nonspecific 
binding determined in the presence of 10-100 uM EZE-glucuronide are shown. Specific binding 
was calculated from the difference between total and nonspecific binding. Data were fit by 
nonlinear regression as described above, and the linear Scatchard plot is shown. In rhesus 
membranes, the data correspond to a single binding site with = 41 nM and a concentration of 
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5.5 pmol/mg membrane protein. The affinity is ~10-fold lower in rat membranes (K<j = 540 nM). 
3 H-EZE-glucuronide was not useful as a ligand for a binding assay for the mouse target due to 
the compounds low affinity in mouse membrane. These potencies correlate roughly with the 
sensitivity of these species to ezetimibe inhibition of cholesterol uptake. 

[0319] Rate constants for binding and dissociation. Ezetimibe-glucuronide is slow to bind, 
and forms a relatively long-lived complex with its receptor. Indeed, this was key to detecting the 
interaction in a traditional filter-binding assay, as ligand/receptor interactions with K<j values 
greater than 100 nM often go unrecognized because of the typical fast off-rates of the ligands. 
Rate constants for association (ko n ) and dissociation (koff) of compound 1 were determined for rat 
and rhesus membranes, and used as an alternative method to calculate the dissociation constant 
(IQ) according to the relationship IQ = kofi/kon. 300ug/assay of rat brush border membrane 
vesicles were incubated with 25nM 3 H-EZE-glucuronide at room temperature for up to three 
hours for the association kinetic studies. 83 ug/assay of rhesus brush border membrane vesicles 
were incubated with 25nM 3 H-EZE-glucuronide at room temperature for up to five hours for the 
association kinetic studies. Nonspecific binding measured in the presence of lOOuM EZE- 
glucuronide was substracted from the total binding to calculate the specific binding shown in 
Figures 3 A and 4A. For the dissociation kinetic study, rat brush border membrane vesicles were 
incubated with 25nM 3 H-EZE-glucuronide for 2 hours at room temperature and ligand 
dissociation was initiated by the addition of lOOuM EZE-glucuronide. Rhesus brush border 
membrane vesicles were incubated with 42nM 3 H-EZE-glucuronide for 4 hours at room 
temperature and ligand dissociation was initiated by the addition of lOOuM EZE-glucuronide. 
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For both rat and rhesus dissociation studies, samples were collected at various times and 
radiolabel was detected. Dissociation curves are shown in Figures 3B (rat) and 4B (rhesus). 

[0320] For rat membranes, the rate constant for association is ko n = 5,540 M" 1 s" 1 (compared to 
10 8 tolO 9 M" 1 s" 1 for diffusion controlled encounter), and the rate constant for dissociation is kofr 
= 2.4 x 10" 3 s" 1 , corresponding to a half-life of 4.7 min. The data are shown in Figure 3, where 
the solid lines are theoretical for these rate constants. The value predicted from these rate 
constants (K<j = koff/ko n = 440 nM) agrees well with that measured at equilibrium (K^ = 540 nM). 

[0321] For rhesus membranes, where H-ezetimibe glucuronide is at least 10-fold more potent 
(as described above), the association rate remains the same but the half-life for dissociation of 
the complex increases to ~90 min. These data are shown in Figure 4, where the theoretical lines 
correspond to ko n = 3,900 M' 1 s' 1 and koff = 1.23 x 10" 4 s"\ and predict K<j = 32 nM compared to 
that measured at equilibrium (K<i = 41 nM). 

Example 28: Binding Analysis of a Potent NPC1L1 ligand 

[0322] A 35 S-labeled propargyl-sulfonamide analogue of ezetimibe glucuronide ( 35 S-2) was 
identified as a potential NPC1L1 antagonist. Compound 2 was prepared as described in 
Example 27 and found to have markedly improved affinity for some species of brush border 
membranes vesicles. For rhesus brush border membranes vesicles, 56 \ig protein/assay were 
incubated with 25 nM 3 H-EZE-glucuronide in the presence of increasing concentrations of EZE- 
glucuronide and 2. For rat brush border membranes vesicles, 150 p.g protein/assay were 
incubated with 50 nM 3 H-EZE-glucuronide in the presence of increasing concentrations of EZE- 
glucuronide and 2. For mouse brush border membranes vesicles, 20 ug protein/assay were 
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incubated with 3 nM S-2 in the presence of increasing concentrations of EZE-glucuronide and 
2. 

[0323] 2 is more potent against enterocyte brush border membrane preparations from rats (35- 
fold), but is equipotent with ezetimibe glucuronide for rhesus membrane preparations (Figure 5, 
Table 8). It also has enhanced affinity for mouse membranes (Figure 6, Table 8). 

Table 8. Summary of dissociation constants (Kd) for binding of ezetimibe glucuronide 1 
and its propargyl-sulfonamide derivative 2 to rhesus, rat, and mouse intestinal brush 



border membranes. 



Compound 


Rhesus 


Rat 


Mouse 


1 


39 


530 


2,300 


2 


38 


15 


144 











Kd values are nM. 

Example 29: Distribution of 3 H-ezetimibe glucuronide (1) binding to intestinal tissues. 
[0324] Previous studies have established that cholesterol absorption occurs primarily in the 
jejunum, and is substantially lower in the ileum and duodenum. To determine if the binding 
activity is similarly distributed, the binding assay using 3 H-ezetimibe glucuronide ( 3 H-1) as a 
radioligand was used to determine the distribution of binding sites in sections from rhesus and rat 
intestines. 

[0325] For the rhesus studies, 10 cm corresponding to the ileum of a rhesus small intestine was 
separated and the remaining intestine was divided into three segments, (proximal, middle and 
distal) of equal length (70 cm each). For the rat studies, 10 cm corresponding to the ileum of a 
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rhesus small intestine was separated and the remaining intestine was divided into three segments, 
(proximal, middle and distal) of equal length (36 cm each). Brush border membrane vesicles 
were prepared as described in Example 27. Aliquots of vesicles (100-200ug) protein/assay were 
incubated with 50nM H-EZE-glucuronide in the absence and presence of lOOuM EZE- 
glucuronide. 

[0326] As shown in Figure 7, specific binding for H-ezetimibe glucuronide peaks in the 
jejunum in both species, consistent with the previously observed pattern of cholesterol 
absorption. 

Example 30: Correlation of in vitro and in vivo binding activity of NPC1L1 
[0327] To determine if in vitro binding activity is predictive of in vivo efficacy, the enantiomer 
of ezetimibe glucuronide and several close structural analogues of ezetimibe glucuronide that 
were tested in the rat membrane binding assay were tested in an acute rat cholesterol absorption 
study as described in Examples 23-26. The selected analogs had a range of in vitro potencies, 
and were anticipated to have similar physical properties to ezetimibe glucuronide (Table 9). The 
enantiomer, which has a Kd > 100,000 nM for the rat target, was inactive in the in vivo assay. 
For the other analogs, the same rank order of potency is observed in the in vitro and in vivo 
assays, further evidence that the observed binding is due to the target of ezetimibe. 
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Table 9. IC 5 o values of EZE-gluc and analogs to inhibit binding of 3H-EZE-gluc to rat 
brush border membrane vesicles. 



Compound Name 


R 


X 


Y 


RAT IC 50 
(nM) 


3 


H 


H2 


F 


2,300 


EZE-gluc 1 


H 


(S)-OH 


F 


530 


EZE-gluc enantiomer 4 


H 


(R)-OH 


F 


>1 00000 


5 


H 


(R)-OH 


F 


3,900 


6 


H 


=0 


F 


70,000 


7 


OH 


(S)-OH 


F 


252 




Y 

Backbone structure for compounds 1, 3, 5, 6 and 7. 




Y 



Backbone structure for compound 4. 
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Example 31: Binding affinities of ezetimibe glucuronide and its analogs to recombinant 
NPC1L1 

[0328] NPC1L1 was identified as a candidate target of ezetimibe from a search of genetic 
databases for cholesterol binding motifs. Subsequently, NPC1L1 deficient mice were found to 
have 80% reduction of cholesterol absorption, and did not respond to ezetimibe treatment, 
strongly suggesting that this protein is required for ezetimibe efficacy. To determine if NPC1L1 
is the direct target of ezetimibe, binding affinities were compared for ezetimibe glucuronide and 
several analogs to NPC1L1 transfected cells and rat brush border membrane vesicles. 

[0329] Rat NPC1L1 transfected CHO cells (-500,000 cells/assay) were incubated with 5 nM 
35 S-2 (~1 million dpm/assay) for 2 hours at 37 °C in the absence or presence of increasing 
concentrations of EZE-glucuronide (compound 1), compounds 2, 3, 5, 6, or 8. 

[0330] Human NPC1L1 transfected CHO cells (-600,000 cells/assay) were incubated with 5 
nM 35 S-2 (~1 million dpm/assay) in buffer A for 2 hours at 37 °C in the absence or presence of 
increasing concentrations of EZE-glucuronide (compound 1), compounds 2, 3, 5, 6, or 8. 

[0331] As shown in Figure 8 and Table 10, the affinities for the recombinant and native proteins 
are virtually identical, providing compelling evidence that NPC1L1 is the direct target of 
ezetimibe in rodent tissues, and that other proteins are not required for binding. 

[0332] Affinities of ezetimibe glucuronide and analogues thereof were also determined for 
human recombinant NPC1L1. The results, shown in Figure 9, indicate that ezetimibe 
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glucuronide (1) has an affinity for the human protein of 907 nM. The propargyl-sulfonamide 
analogue (2) is approximately 50-fold more potent, with a K<j = 21 nM, suggesting that this 
compound has the potential for enhanced potency of cholesterol absorption inhibition in man. 
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Table 10: Comparison of dissociation constants (Kd) for binding to native rat intestinal 
brush border membranes and membranes from rat NPC1L1 transfected cells. 



Analog 


Recombinant rat NPC1L1 
Kd, nM 


Rat BBMV 
Kd, nM 


EZE-glucuronide (1) 


790 


600 


2 


12 


15 


3 


2400 


2300 


6 


84500 


70000 


5 


5800 


3900 


8 


556 


818 



Example 32: Binding of S-2 to membranes from wild type and NPC1L1 deficient mice. 
[0333] Final confirmation that NPC1L1 is the target of ezetimibe was provided by binding 
studies with 35 S-2 in intestinal brush border membranes from NPC1L1 deficient and control 
mice. 

[0334] Brush border membranes vesicles were prepared from intestinal tissues of wild type and 
NPC1L1 knockout (-/-) mice. 15, 30 and 60ug protein/ assay of brush border membranes 
vesicles were incubated with 4nM 35 S-2 in buffer A for 3 hours at 37°C in the presence and 
absence of lOOuM EZE-glucuronide. 
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[0335] The results, shown in Figure 10, indicate that no detectable binding is observed in 
membranes from NPC1L1 deficient mice, whereas age matched wild type control membranes 
have detectable binding. The binding affinity observed in this experiment in control mouse 
membranes (K<j=156 nM) was virtually identical to that observed in previous studies (Figure 1 1). 

[0336] The present invention is not to be limited in scope by the specific embodiments described 
herein. Indeed, various modifications of the invention in addition to those described herein will 
become apparent to those skilled in the art from the foregoing description. Such modifications 
are intended to fall within the scope of the appended claims. 

[0337] Patents, patent applications, publications, product descriptions, Genbank Accession 
Numbers and protocols are cited throughout this application, the disclosures of which are 
incorporated herein by reference in their entireties for all purposes. 
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I Claim: 

1. A method for identifying a ligand of NPC1L1 comprising: 

contacting NPC1L1 with a detectably labeled substituted 2-azetidinone and a candidate 
compound; and 

determining whether said candidate compound binds to NPC1L1 ; 

wherein binding of said compound to NPC1L1 modulates binding of said detectably labeled 
substituted 2-azetidinone to NPC1L1 and 

wherein said modulation indicates that the candidate compound is a ligand that binds to 
NPC1L1. 

2. The method of claim 1, wherein NPC1L1 has an amino acid sequence selected 
from the group consisting of SEQ ED NO: 2,' SEQ ID NO: 4, SEQ ID NO: 12 and a fragment 
thereof. 

3. The method of claim 1, wherein binding of said substituted 2-azetidinone to 
NPC1L1 is disrupted. 

4. The method of claim 1, wherein NPC1L1 is membrane-bound. 

5. The method of claim 4, wherein the membrane is isolated from mammalian 

intestines. 

6. The method of claim 5, wherein the membrane is a brush border membrane. 

7. The method of claim 6, wherein the brush border membrane is from the group 
consisting of rhesus, rat, murine, and human intestinal tissue. 

8. The method of claim 4, wherein the membrane is vesicularized. 

9. The method of claim 4, wherein the membrane is isolated from cultured cells. 
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10. The method of claim 9, wherein the membrane is isolated from recombinant 
NPC1 LI -expressing cells. 

11. The method of claim 1, wherein NPC1L1 is solubilized. 

12. The method of claim 11, wherein NPC1L1 is isolated from membranes 
derived from mammalian intestines. 

13. The method of claim 12, wherein the membranes are brush border 

membranes. 

14. The method of claim 13, wherein the brush border membranes are from the 
group consisting of rhesus, rat, murine, and human intestinal tissue. 

15. The method of claim 1, wherein the detectably labeled substituted 2- 
azetidinone is selected from the group consisting of 3 H, 35 S, l25 I, and a fluorescently labeled 
substituted 2-azetidinone. 

16. The method of claim 1, further comprising incubating NPC1L1 with said 
detectably labeled substituted 2-azetidinone and said candidate compound. 

17. The method of claim 15, wherein the binding of said detectably labeled 
substituted 2-azetidinone to NPC1L1 is detected by liquid scintillation counting. 

18. The method of claim 17, wherein NPC1L1 bound to detectably labeled 
substituted 2-azetidinone is collected onto filters by vacuum filtration. 

19. The method of claim 17, wherein the binding of said detectably labeled 
substituted 2-azetidinone to NPC1L1 is detected using a scintillation proximity assay. 

20. The method of claim 1, wherein said detectably labeled substituted 2- 
azetidinone is a substituted 2-azetidinone-glucuronide. 
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21. The method of claim 20, wherein the substituted 2-azetidinone-glucuronide is 
selected from the group consisting of ezetimibe-glucuronide and compound 2. 

22. The method of claim 21, wherein said substituted 2-azetidinone-glucuronide 
is ezetimibe-glucuronide. 

23. The method of claim 21, wherein said substituted 2-azetidinone-glucuronide 
is compound 2. 

24. A method for identifying a compound that inhibits intestinal sterol or 5a- 
stanol absorption, wherein said absorption is mediated by NPC1L1, comprising 

contacting NPC1L1 with a detectably labeled ligand and a candidate compound; and 
determining whether said candidate compound binds to NPC1L1 ; 
wherein binding of said candidate compound to NPC1L1 modulates binding of said ligand to 
NPC1L1, 

wherein said modulation indicates that the candidate compound is an intestinal sterol or 5a- 
stanol absorption inhibitor. 

25. The method of claim 24, wherein the sterol is selected from the group 
consisting of cholesterol, sitosterol, campesterol, stigmasterol, avenosterol and a cholesterol 
oxidation product, and the 5a-stanol is selected from the group consisting of cholestanol, 5a- 
campesterol, and 5a-sitostanol. 

26. A method for identifying a ligand of NPC1 LI comprising: 
contacting NPC1L1 with a detectably labeled substituted 2-azetidinone; and 
measuring binding of NPC1L1 with the detectably labeled substituted 2-azetidinone in 

the presence and absence of a candidate compound; 
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wherein decreased binding of the detectably labeled substituted 2-azetidinone to the NPC1L1 
in the presence of the candidate compound indicates that said candidate compound is a ligand of 
NPC1L1. 

27. The method of claim 26, wherein NPC1L1 has an amino acid sequence 
selected from the group consisting of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 12 and a 
fragment thereof. 

28. The method of claim 27, wherein NPC1L1 is membrane-bound. 

29. The method of claim 28, wherein the membrane is isolated from mammalian 

intestines. 

30. The method of claim 29, wherein the membrane is a brush border 

membranes. 

3 1 . The method of claim 30, wherein the brush border membrane is from the 
group consisting of rhesus, rat, murine, and human intestinal tissue. 

32. The method of claim 28, wherein the membrane is vesicularized. 

33. The method of claim 28, wherein the membrane is isolated from cultured 

cells. 

34. The method of claim 33, wherein the membrane is isolated from recombinant 
NPC1 LI -expressing cells. 

35. The method of claim 27, wherein NPC1L1 is solubilized. 

36. The method of claim 35, wherein NPC1L1 is isolated from membranes 
derived from mammalian intestines. 
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37. The method of claim 36, wherein the membranes are brush border 

membranes. 

38. The method of claim 37, wherein the brush border membranes are derived 
from the group consisting of rhesus, rat, murine, and human intestinal tissue. 

39. The method of claim 26, wherein the detectably labeled substituted 2- 
azetidinone is selected from the group consisting of 3 H, 35 S, 125 I, and a fluorescently labeled 
substituted 2-azetidinone. 

40. The method of claim 26, further comprising incubating NPC1L1 with said 
detectably labeled substituted 2-azetidinone and said candidate compound. 

41. The method of claim 39, wherein the binding of said detectably labeled 
substituted 2-azetidinone to NPC1L1 is detected by liquid scintillation counting. 

42. The method of claim 41, wherein NPC1L1 bound to detectably labeled 
substituted 2-azetidinone is collected onto filters by vacuum filtration. 

43. The method of claim 41, wherein the binding of said detectably labeled 
substituted 2-azetidinone-glucuronide to NPC1L1 is detected using a scintillation proximity 
assay. 

44. The method of claim 26, wherein said detectably labeled substituted 2- 
azetidinone is a substituted 2-azetidinone-glucuronide. 

45. The method of claim 44, wherein the substituted 2-azetidinone-glucuronide is 
selected from the group consisting of ezetimibe-glucuronide and compound 2. 

46. The method of claim 45, wherein said detectably labeled substituted 2- 
azetidinone-glucuronide is ezetimibe-glucuronide. 
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47. The method of claim 45, wherein said detectably labeled substituted 2- 
azetidinone-glucuronide is compound 2. 

48. A method for identifying a compound that inhibits intestinal sterol or 5a- 
stanol absorption wherein said absorption is mediated by NPC1L1 comprising 

contacting NPC1L1 with a detectably labeled ligand and the candidate compound; 

measuring binding of NPC1L1 with the detectably labeled ligand in the presence 
and absence of a candidate compound; 

wherein binding of said candidate compound to NPC1L1 modulates binding of said 
ligand to NPC1L1, 

wherein said binding indicates that the candidate compound is an intestinal steroid or 5 a- 
steroid absorption inhibitor. 

49. The method of claim 48, wherein the sterol is selected from the group 
consisting of cholesterol, sitosterol, campesterol, stigmasterol, avenosterol and a cholesterol 
oxidation product, and the 5a-stanol is selected from the group consisting of cholestanol, 5a- 
campesterol, and 5a-sitostanol. 
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ABSTRACT OF THE DISCLOSURE 
The present invention provides human, rat and mouse NPC1L1 polypeptides and 
polynucleotides encoding the polypeptides. Methods for detecting ligands which bind to 
NPC1L1 and block intestinal cholesterol absorption are provided. Also included is a method of 
identifying ligands which bind to NPC1L1 using membranes derived from brush border 
membrane preparations. Compounds that bind to NPC1L1 can be used for inhibiting intestinal 
cholesterol absorption in a subject. 
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SEQUENCE LISTING 



<110> Garcia-Calvo, Margarita 

<120> NPC1L1 (NPC3) AND METHODS OF IDENTIFYING LIGANDS THEREOF 

<130> P36134' 074669.0105 
<140> 

<141> 2004-01-16 

<160> 51 



<170> Patentln. version 3.1 



<210> 1 

<211> 3996 

<212> DNA 

<213> Rattus sp. 



<220> 

<221> CDS 

<222> (1) . . (3996) 

<223> 



<400> 1 
atg gca get 
Met Ala Ala 
1 

gcg gec cag 
Ala Ala Gin 



acc ttt tac 
Thr Phe Tyr 
35 

acg tea eta 
Thr Ser Leu 



gee tgg ctg 
Ala Trp Leu 
5 

ggt gag eta 
Gly Glu Leu 
20 

gaa gag tgc 
Glu Glu Cys 



tec aat gta 
Ser Asn Val 



gga tgg ctg 
Gly Trp Leu 



tac aca ccc 
Tyr Thr Pro 
25 

ggg aaa aac 
Gly Lys Asn 
40 

tec tgc ctg 
Ser Cys Leu 



etc tgg gee 
Leu Trp Ala 
10 

aaa cac gaa 
Lys His Glu 



cca gag etc 
Pro Glu Leu 



tct aac acc 
Ser Asn Thr 



ctg etc ctg 
Leu Leu Leu 
15 

get ggg gtc 
Ala Gly Val 
30 

tct gga ggc 
Ser Gly Gly 
45 

ccg gee cgc 
Pro Ala Arg 



age 48 
Ser 



tgc 96 
Cys 



etc 144 
Leu 



cac 192 
His 



50 



55 



60 



gtc acg ggt gaa cac ctg get ctt etc cag cgc ate tgt ccc cgc ctg 240 
Val Thr Gly Glu His Leu Ala Leu Leu Gin Arg lie Cys Pro Arg Leu 
65 70 75 80 

tac aac ggc ccc aat ace act ttt gee tgt tgc tct acc aag cag ctg 288 
Tyr Asn Gly Pro Asn Thr Thr Phe Ala Cys Cys Ser Thr Lys Gin Leu 
85 90 95 ' 

ctg tec tta gaa age age atg tec ate acc aag gee ctt etc acg cgc 336 
Leu Ser Leu Glu Ser Ser Met Ser lie Thr Lys Ala Leu Leu Thr Arg 
100 105 " 110 

tgc ccg gee tgc tct gac aat ttt gtg age tta cac tgc cac aac act 384 
Cys Pro Ala Cys Ser Asp Asn Phe Val Ser Leu His Cys His Asn Thr 
115 120 125 

tgc age cct gac cag age etc ttc ate aac gtc acc egg gtg gtt gag 432 
Cys Ser Pro Asp Gin Ser Leu Phe lie Asn Val Thr Arg Val Val Glu 
130 135 140 

egg ggc get gga gag cct cct gee gtg gtg gee tat gag gec ttt tat 480 
Arg Gly Ala Gly Glu Pro Pro Ala Val Val Ala Tyr Glu Ala Phe Tyr 
145 150 155 160 

cag cgc age ttt get gag aag gee tat gag tec tgc age cag gtg cgc 528 
Gin Arg Ser Phe Ala Glu Lys Ala Tyr Glu Ser Cys Ser Gin Val Arg 
165 170 175 

ate cct gcg gee get tec ttg gee gtg ggc age atg tgt gga gtg tat 576 
lie Pro Ala Ala Ala Ser Leu Ala Val Gly Ser Met Cys Gly Val Tyr 
180 185 190 

ggc tec gee etc tgc aat get cag cgc tgg etc aac ttc caa gga gac 624 
Gly Ser Ala Leu Cys Asn Ala Gin Arg Trp Leu Asn Phe Gin Gly Asp 
195 200 205 

aca ggg aat ggc ctg get ccg ctg gat ate acc ttc cac etc ttg gag 672 
Thr Gly Asn Gly Leu Ala Pro Leu Asp lie Thr Phe His Leu Leu Glu 
210 215 220 

cct ggc cag gee eta ccg gat ggg ate cag cca ctg aat ggg aag ate 720 
Pro Gly Gin Ala Leu Pro Asp Gly lie- Gin Pro Leu Asn Gly Lys lie 
225 230 235 240 

gca ccc tgc aac gag tct cag ggt gat gac tea gca gtc tgc tec tgc 768 
Ala Pro Cys Asn Glu Ser Gin Gly Asp Asp Ser Ala Val Cys Ser Cys 
245 250 255 

cag gac tgt gcg gcg tec tgc cct gtc ate cct ccg ccc gag gec ttg 816 
Gin Asp Cys Ala Ala Ser Cys Pro Val lie Pro Pro Pro Glu Ala Leu 
260 265 270 

cgc cct tec ttc tac atg ggt cgc atg cca ggc tgg ctg gec etc ate 864 
Arg Pro Ser Phe Tyr Met Gly Arg Met Pro Gly Trp Leu Ala Leu lie 
275 280 285 

ate ate ttc act get gtc ttt gtg ttg etc tct gca gtc ctt gtg cgt 912 



He He Phe Thr Ala Val Phe Val Leu Leu Ser Ala Val Leu Val Arg 
290 295 300 



etc cga gtg 
Leu Arg Val 
305 

gaa gec ccc 
Glu Ala Pro 



ctg ggc egg 
Leu Gly Arg 



etc ace gtc 
Leu Thr Val 
355 

ggc ctg acc 
Gly Leu Thr 
370 

gec ccc aag 
Ala Pro Lys 
385 

ttc ggc ccc 
Phe Gly Pro 



agg tec age 
Arg Ser Ser 



agt ggg ate 
Ser Gly He 
435 

gag agg ctt 
Glu Arg Leu 
450 

ate tec etc 
He Ser Leu 
465 

age etc tec 
Ser Leu Ser 



aac cgc acc 
Asn Arg Thr 



acc tec ctg 
Thr Ser Leu 
515 



gtt tec aac 
Val Ser Asn 
310 

aaa etc cct 
Lys Leu Pro 
325 

ttc ttc cag 
Phe Phe Gin 
340 

tta gca ctg 
Leu Ala Leu 



ttt att gaa 
Phe He Glu 



age cag gec 
Ser Gin Ala 
390 

ttc ttt cga 
Phe Phe Arg 
405 

tac aag tac 
Tyr Lys Tyr 
420 

ctg tec ctg 
Leu Ser Leu 



cga cac ctg 
Arg His Leu 



cag gac ate 
Gin Asp He 
470 

gac tgc tgt 
Asp Cys Cys 
485 

etc ctg atg 
Leu Leu Met 
500 

gtg gac tgg 
Val Asp Trp 



agg aac aag 
Arg Asn Lys 



cat aag cac 
His Lys His 



aac tgg ggc 
Asn Trp Gly 
345 

tec ttc ate 
Ser Phe He 
360 

etc acc aca 
Leu Thr Thr 
375 

egg aaa gag 
Arg Lys Glu 



acc aac cag 
Thr Asn Gin 



gac tec eta 
Asp Ser Leu 
425 

gac ttc ctg 
Asp Phe Leu 
440 

caa gtg tgg 
Gin Val Trp 
455 

tgc tat gee 
Cys Tyr Ala 



gtc aac age 
Val Asn Ser 



etc acg gee 
Leu Thr Ala 
505 

aag gac cat 
Lys Asp His 
520 



aac aag gca 
Asn Lys Ala 
315 

aaa etc tea 
Lys Leu Ser 
330 

aca agg gtg 
Thr Arg Val 



gtt gtg ata 
Val Val He 



gac cct gtg 
Asp Pro Val 
380 

aag tct ttc 
Lys Ser Phe 
395 

att ttc gtg 
He Phe Val 
410 

ctg eta ggg 
Leu Leu Gly 



ctg gag ctg 
Leu Glu Leu 



tec cct gag 
Ser Pro Glu 
460 

ccc etc aac 
Pro Leu Asn 
475 

etc ctt cag 
Leu Leu Gin 
490 

aac cag act 
Asn Gin Thr 



ttc etc tac 
Phe Leu Tyr 



gaa ggc ccc 
Glu Gly Pro 



ccc cat acc 
Pro His Thr 
335 

gec teg tgg 
Ala Ser Trp 
350 

gee tta gca 
Ala Leu Ala 
365 

gaa ctg tgg 
Glu Leu Trp 



cat gat gag 
His Asp Glu 



aca get egg 
Thr Ala Arg 
415 

tec aag aac 
Ser Lys Asn 
430 

ctg gag ctt 
Leu Glu Leu 
445 

gca gag cgc 
Ala Glu Arg 



cca tat aac 
Pro Tyr Asn 



tac ttc cag 
Tyr Phe Gin 
495 

ctg aat ggc 
Leu Asn Gly 
510 

tgt gca aat 
Cys Ala Asn 
525 



cag 960 

Gin 

320 

ate 1008 
He 



cca 1056 
Pro 



gca 1104 
Ala 



teg 1152 
Ser 



cat 1200 

His 

400 

aac 1248 
Asn 



ttc 1296 
Phe 



cag 1344 
Gin 



aac 1392 
Asn 



acc 1440 

Thr 

480 

aac 1488 
Asn 



cag 1536 
Gin 



gec 1584 
Ala 



cct etc acg 
Pro Leu Thr 
530 

gac tac ggg 
Asp Tyr Gly 
545 

ggc acg gac 
Gly Thr Asp 



aat aac tac 
Asn Asn Tyr 



gag gag get 
Glu Glu Ala 
595 

gac aag ttc 
Asp Lys Phe 
610 

ate aac cgc 
lie Asn Arg 
625 

att ate gtc 
He He Val 



tgc age cga 
Cys Ser Arg 



gtg att gtt 
Val He Val 
67 5 

tac ctg ggt 
Tyr Leu Gly 
690 

ctg gtg eta 
Leu Val Leu 
705 

tac cag agg 
Tyr Gin Arg 



ggc cgc acc 
Gly Arg Thr 



tct gag gec 
Ser Glu Ala 
755 



ttc aaa gat 
Phe Lys Asp 



get cct gtc 
Ala Pro Val 
550 

tat tec gag 
Tyr Ser Glu 
565 

ccc get gat 
Pro Ala Asp 
580 

ttc ttg aag 
Phe Leu Lys 



cag gtt gcg 
Gin Val Ala 



acc acc ate 
Thr Thr He 
630 

ttc ctg tac 
Phe Leu Tyr 
645 

gta gcg gtg 
Val Ala Val 
660 

gtg ctg gga 
Val Leu Gly 



gtc ccc tct 
Val Pro Ser 



get gtg gga 
Ala Val Gly 
710 

eta cct agg 
Leu Pro Arg 
725 

ctg ggc agt 
Leu Gly Ser 
740 

ate tgc ttc 
He Cys Phe 



ggc acg tct 
Gly Thr Ser 
535 

ttc ccc ttc 
Phe Pro Phe 



gca gaa gcg 
Ala Glu Ala 



gat ccc cgc 
Asp Pro Arg 
585 

gaa atg gaa 
Glu Met Glu 
600 

ttc tea get 
Phe Ser Ala 
615 

cag gac ctg 
Gin Asp Leu 



ate tec ctg 
He Ser Leu 



gag tec aag 
Glu Ser Lys 
665 

gca gtt ctg 
Ala Val Leu 
680 

tct ctg gtt 
Ser Leu Val 
695 

get .gac aac 
Ala Asp Asn 



atg cct ggg 
Met Pro Gly 



gtg gee ccc 
Val Ala Pro 
745 

ttt eta ggg 
Phe Leu Gly 
760 



ctg gec ctg 
Leu Ala Leu 
540 

ctt get gtt 
Leu Ala Val 
555 

ctg ate ata 
Leu He He 
570 

atg gec cag 
Met Ala Gin 



tec ttc cag 
Ser Phe Gin 



gag cgc tct 
Glu Arg Ser 
620 

cct gtc ttt 
Pro Val Phe 
635 

gee ctg ggc 
Ala Leu Gly 
650 

get act ctg 
Ala Thr Leu 



get gec atg 
Ala Ala Met 



ate ate caa 
He He Gin 
700 

ate ttc ate 
He Phe He 
715 

gaa cag cga 
Glu Gin Arg 
730 

age atg ctg 
Ser Met Leu 



gee ctg acc 
Ala Leu Thr 



age tgc atg 
Ser Cys Met 



ggg gga tac 
Gly Gly Tyr 



acc ttc tct 
Thr Phe Ser 
575 

gee aag etc 
Ala Lys Leu 
590 

agg aac aca 
Arg Asn Thr 
605 

ctg gag gat 
Leu Glu Asp 



gee gtc age 
Ala Val Ser 



age tac tec 
Ser Tyr Ser 
655 

ggc eta ggt 
Gly Leu Gly 
670 

ggc ttc tac 
Gly Phe Tyr 
685 

gtg gta cct 
Val Val Pro 



ttt gtt ctt 
Phe Val Leu 



gag get cac 
Glu Ala His 
735 

ctg tgc age 
Leu Cys Ser 
750 

ccc atg cca 
Pro Met Pro 
765 



get 1632 
Ala 



caa 1680 

Gin 

560 

etc 1728 
Leu 



tgg 1776 
Trp 



agt 1824 
Ser 



gag 1872 
Glu 



tac 1920 

Tyr 

640 

aga 1968 
Arg 



ggg 2016 
Gly 



tec 2064 
Ser 



ttc 2112 
Phe 



gag 2160 

Glu 

720 

att 2208 
He 



etc 2256 
Leu 



get 2304 
Ala 



gtg agg acc 
Val Arg Thr 
770 

ctg etc cag 
Leu Leu Gin 
785 

agg cag gag 
Arg Gin Glu 



aag ctg ccc 
Lys Leu Pro 



cgc aag ata 
Arg Lys lie 
835 

gtg atg ctg 
Val Met Leu 
850 

tgc aac ate 
Cys Asn lie 
865 

teg tac ttg 
Ser Tyr Leu 



ggg cct cca 
Gly Pro Pro 



gag gca ggc 
Glu Ala Gly 
915 

tec eta acc 
Ser Leu Thr 
930 

tac gtg get 
Tyr Val Ala 
945 

ctg acc ccg 
Leu Thr Pro 



aag gat gag 
Lys Asp Glu 



ttc gee ttg 
Phe Ala Leu 



atg act gec 
Met Thr Ala 
790 

gee tct cgc 
Ala Ser Arg 
805 

cca cct aaa 
Pro Pro Lys 
820 

tac get cct 
Tyr Ala Pro 



ctg ttt ctg 
Leu Phe Leu 



aac gtg ggg 
Asn Val Gly 
870 

ata gac tac 
lie Asp Tyr 
885 

gtg tac ttt 
Val Tyr Phe 
900 

atg aac gee 
Met Asn Ala 



cag aaa ate 
Gin Lys lie 



att get gca 
He Ala Ala 
950 

tec tec tec 
Ser Ser Ser 
965 

ttc tgt ccc 
Phe Cys Pro 
980 



acc tct ggc 
Thr Ser Gly 
775 

ttt gtg gee 
Phe Val Ala 



ccg gat gtc 
Pro Asp Val 



gaa aaa gaa 
Glu Lys Glu 
825 

ttc ctg ctg 
Phe Leu Leu 
840 

acc ctg ttt 
Thr Leu Phe 
855 

eta gac cag 
Leu Asp Gin 



ttc etc ttt 
Phe Leu Phe 



gtc acc acc 
Val Thr Thr 
905 

act tgc tct 
Thr Cys Ser 
920 

cag tat gee 
Gin Tyr Ala 
935 

tec tec tgg 
Ser Ser Trp 



tgc tgt cgc 
Cys Cys Arg 



tea acg gat 
Ser Thr Asp 
985 



tta gca att 
Leu Ala He 
780 

ctg etc tec 
Leu Leu Ser 
795 

tta tgc tgc 
Leu Cys Cys 
810 

ggc etc tta 
Gly Leu Leu 



cac aga ttc 
His Arg Phe 



gga gca aat 
Gly Ala Asn 
860 

gag ctg get 
Glu Leu Ala 
875 

ctg aac cga 
Leu Asn Arg 
890 

teg ggc ttc 
Ser Gly Phe 



age gca ggc 
Ser Ala Gly 



agt gaa ttc 
Ser Glu Phe 
940 

gta gat gac 
Val Asp Asp 
955 

ctt tat ata 
Leu Tyr He 
970 

act tec ttc 
Thr Ser Phe 



ate etc gac 
He Leu Asp 



ctg gat age 
Leu Asp Ser 



ttt tea acc 
Phe Ser Thr 
815 

etc cgc ttc 
Leu Arg Phe 
830 

ate cgc cct 
He Arg Pro 
845 

etc tac tta 
Leu Tyr Leu 



ctg ccc aag 
Leu Pro Lys 



tac ctt gaa 
Tyr Leu Glu 
895 

aac ttc tec 
Asn Phe Ser 
910 

tgt aag age 
Cys Lys Ser 
925 

cct gac cag 
Pro Asp Gin 



ttc ate gac 
Phe He Asp 



cgt ggc ccc 
Arg Gly Pro 
975 

aac tgc tta 
Asn Cys Leu 
990 



ttc 2352 
Phe 



aag 2400 

Lys 

800 

egg 24 4 8 

Arg 



ttc 2496 
Phe 



gtt 2544 
Val 



atg 2592 
Met 



gac 2640 

Asp 

880 

gtg 2688 
Val 



age 2736 
Ser 



ttc 2784 
Phe 



tct 2832 
Ser 



tgg 2880 

Trp 

960 

cat 2928 
His 



aaa 2976 
Lys 



aac tgc atg aac cgc act ctg ggt cct gtg agg ccc aca gcg gaa cag 3024 
Asn Cys Met Asn Arg Thr Leu Gly Pro Val Arg Pro Thr Ala Glu Gin 



995 



1000 



1005 



ttt cat aag tac ctg ccc 
Phe His Lys Tyr Leu Pro 
1010 

aga tgt ccc aaa ggg ggt 
Arg Cys Pro Lys Gly Gly 
1025 

ttg age tea gat ggc cag 
Leu Ser Ser Asp Gly Gin 
1040 

cac aag ccc tta agg aac 
His Lys Pro Leu Arg Asn 
1055 

gcg tec egg ttg eta gca 
Ala Ser Arg Leu Leu Ala 
1070 

gtg cct ggg aca gat cca 
Val Pro Gly Thr Asp Pro 
1085 

tec aac gtg ttc tac cag 
Ser Asn Val Phe Tyr Gin 
1100 

ate ttc ace ctt get ctt 
lie Phe Thr Leu Ala Leu 
1115 

tac etc eta ctg ggc ctg 
Tyr Leu Leu Leu Gly Leu 
1130 

etc tec ate att atg att 
Leu Ser He He Met He 
1145 

gtg tgg ggt ate age tat 
Val Trp Gly He Ser Tyr 
1160 

acg gca gtg ggc atg tct 
Thr Ala Val Gly Met Ser 
1175 

tec ttt get gta age ace 
Ser Phe Ala Val Ser Thr 
1190 

gat get act gtc ttc atg 
Asp Ala Thr Val Phe Met 
1205 

atg ace aac ttc cca ggc 



tgg ttc ctg aat gat 
Trp Phe Leu Asn Asp 
1015 

eta gca gcg tat aga 
Leu Ala Ala Tyr Arg 
1030 

gtt ata gee tec cag 
Val He Ala Ser Gin 
1045 

tea cag gac ttc aca 
Ser Gin Asp Phe Thr 
1060 

gee aac ate aca get 
Ala Asn lie Thr Ala 
1075 

aac ttt gag gtc ttc 
Asn Phe Glu Val Phe 
1090 

caa tac ctg acg gtc 
Gin Tyr Leu Thr Val 
1105 

tgc ttt gtg ccc ace 
Cys Phe Val Pro Thr 
1120 

gac atg tgc tea ggg 
Asp Met Cys Ser Gly 
1135 

etc gtg gac acc att 
Leu Val Asp Thr He 
1150 

aat gcg gta tec etc 
Asn Ala Val Ser Leu 
1165 

gtg gag ttt gtg tec 
Val Glu Phe Val Ser 
1180 

aag cct acc egg ctg 
Lys Pro Thr Arg Leu 
1195 

ggc agt gcg gtg ttt 
Gly Ser Ala Val Phe 
1210 

ate etc ate ttg ggc 



ccg ccc aat ate 3069 

Pro Pro Asn He 

1020 

acg tct gtg aat 3114 

Thr Ser Val Asn 

1035 

ttc atg gee tac 3159 

Phe Met Ala Tyr 

1050 

gaa get etc egg 3204 

Glu Ala Leu Arg 

1065 

gac eta egg aag 3249 

Asp Leu Arg Lys 

1080 

cct tac acg ate 3294 

Pro Tyr Thr He 

1095 

ctt cct gag gga 3339 

Leu Pro Glu Gly 

1110 

ttt gtt gtc tgc 3384 

Phe Val Val Cys 

1125 

ate etc aac eta 3429 

He Leu Asn Leu 

1140 

ggc etc atg get 3474 

Gly Leu Met Ala 

1155 

ate aac ctt gtc -3519 

He Asn Leu Val 

1170 

cac ate act egg 3564 

His lie Thr Arg 

1185 

gag agg get aaa 3609 

Glu Arg Ala Lys 

1200 

get gga gtg gee 3654 

Ala Gly Val Ala 

1215 

ttt gee caa gee 3699 



Met Thr Asn Phe Pro Gly lie Leu lie Leu Gly Phe Ala Gin Ala 
1220 1225 1230 



cag ctt att cag ate ttc ttc ttc cgc etc aac ctt ctg ate ace 3744 
Gin Leu He Gin He Phe Phe Phe Arg Leu Asn Leu Leu He Thr 
1235 1240 1245 

ttg ctg ggt ctg ctg cat ggc ctg gtc ttc ctg ccg gtt gtc etc 3789 
Leu Leu Gly Leu Leu His Gly Leu Val Phe Leu Pro Val Val Leu 
1250 1255 1260 

age tat ctg gga cca gat gtt aac caa get ctg gta cag gag gag 3834 
Ser Tyr Leu Gly Pro Asp Val Asn Gin Ala Leu Val Gin Glu Glu 
1265 1270 1275 

aaa eta gee age gag gca gca gtg gee cca gag cct tct tgc cca 3879 
Lys Leu Ala Ser Glu Ala Ala Val Ala Pro Glu Pro Ser Cys Pro 
1280 1285 1290 

cag tac ccc tec cct get gat gcg gat gee aat gtt aac tac ggc 3924 
Gin Tyr Pro Ser Pro Ala Asp Ala Asp Ala Asn Val Asn Tyr Gly 
1295 1300 1305 

ttt gee cca gaa ctt gee cac gga get aat get get aga age tct 3969 
Phe Ala Pro Glu Leu Ala His Gly Ala Asn Ala Ala Arg Ser Ser 
1310 1315 1320 

ttg ccc aaa agt gac caa aag ttc taa 3996 
Leu Pro Lys Ser Asp Gin Lys Phe 
1325 1330 



<210> 2 

<211> 1331 

<212> PRT 

<213> Rattus sp. 

<400> 2 

Met Ala Ala Ala Trp Leu Gly Trp Leu Leu Trp Ala Leu Leu Leu Ser 
15 10 15 



Ala Ala Gin Gly Glu Leu Tyr Thr Pro Lys His Glu Ala Gly Val Cys 
20 25 30 



Thr Phe Tyr Glu Glu Cys Gly Lys Asn Pro Glu Leu Ser Gly Gly Leu 
35 40 45 



Thr Ser Leu Ser Asn Val Ser Cys Leu Ser Asn Thr Pro Ala Arg His 
50 55 60 



Val Thr Gly Glu His Leu Ala Leu Leu Gin Arg lie Cys Pro Arg Leu 
65 70 75 80 



Tyr Asn Gly Pro Asn Thr Thr Phe Ala Cys Cys Ser Thr Lys Gin Leu 
85 90 95 



Leu Ser Leu Glu Ser Ser Met Ser lie Thr Lys Ala Leu Leu Thr Arg 
100 105 110 



Cys Pro Ala Cys Ser Asp Asn Phe Val Ser Leu His Cys His Asn Thr 
115 120 125 



Cys Ser Pro Asp Gin Ser Leu Phe He Asn Val Thr Arg Val Val Glu 
130 135 140 



Arg Gly Ala Gly Glu Pro Pro Ala Val Val Ala Tyr Glu Ala Phe Tyr 
145 150 155 160 



Gin Arg Ser Phe Ala Glu Lys Ala Tyr Glu Ser Cys Ser Gin Val Arg 
165 170 ~ 175 



He Pro Ala Ala Ala Ser Leu Ala Val Gly Ser Met Cys Gly Val Tyr 
180 135 * 190 



Gly Ser Ala Leu Cys Asn Ala Gin Arg Trp Leu Asn Phe Gin Gly Asp 
195 200 205 



Thr Gly Asn Gly Leu Ala Pro Leu Asp He Thr Phe His Leu Leu Glu 
210 215 220 



Pro Gly Gin Ala Leu Pro Asp Gly He Gin Pro Leu Asn Gly Lys He 
225 230 235 240 



Ala Pro Cys Asn Glu Ser Gin Gly Asp Asp Ser Ala Val Cys Ser Cys 
245 250 255 



Gin Asp Cys Ala Ala Ser Cys Pro Val He Pro Pro Pro Glu Ala Leu 
260 265 270 



Arg Pro Ser Phe Tyr Met Gly Arg Met Pro Gly Trp Leu Ala Leu He 
275 280 " 285 



He lie Phe Thr Ala Val Phe Val Leu Leu Ser Ala Val Leu Val Arg 
290 295 300 



Leu Arg Val Val Ser Asn Arg Asn Lys Asn Lys Ala Glu Gly Pro Gin 
305 310 315 320 



Glu Ala Pro Lys Leu Pro His Lys His Lys Leu Ser Pro His Thr lie 
325 330 335 



Leu Gly Arg Phe Phe Gin Asn Trp Gly Thr Arg Val Ala Ser Trp Pro 
340 345 350 



Leu Thr Val Leu Ala Leu Ser Phe He Val Val He Ala Leu Ala Ala 
355 360 365 



Gly Leu Thr Phe He Glu Leu Thr Thr Asp Pro Val Glu Leu Trp Ser 
370 375 380 



Ala Pro Lys Ser Gin Ala Arg Lys Glu Lys Ser Phe His Asp Glu His 
385 390 395 ~ 400 



Phe Gly Pro Phe Phe Arg Thr Asn Gin He Phe Val Thr Ala Arg Asn 
405 410 415 



Arg Ser Ser Tyr Lys Tyr Asp Ser Leu Leu Leu Gly Ser Lys Asn Phe 
420 425 430 



Ser Gly He Leu Ser Leu Asp Phe Leu Leu Glu Leu Leu Glu Leu Gin 
435 440 445 



Glu Arg Leu Arg His Leu Gin Val Trp Ser Pro Glu Ala Glu Arg Asn 
450 455 460 



He Ser Leu Gin Asp He Cys Tyr Ala Pro Leu Asn Pro Tyr Asn Thr 
465 470 475 ' 480 



Ser Leu Ser Asp Cys Cys Val Asn Ser Leu Leu Gin Tyr Phe Gin Asn 
485 490 ~ 495 



Asn Arg Thr Leu Leu Met Leu Thr Ala Asn Gin Thr Leu Asn Gly Gin 
500 505 510 



Thr Ser Leu Val Asp Trp Lys Asp His Phe Leu Tyr Cys Ala Asn Ala 
515 520 525 



Pro Leu Thr Phe Lys Asp Gly Thr Ser Leu Ala Leu Ser Cys Met Ala 



530 



535 



540 



Asp Tyr Gly Ala Pro Val Phe Pro Phe Leu Ala Val Gly Gly Tyr Gin 
545 550 555 560 



Gly Thr Asp Tyr Ser Glu Ala Glu Ala Leu lie He Thr Phe Ser Leu 
565 570 575 



Asn Asn Tyr Pro Ala Asp Asp Pro Arg Met Ala Gin Ala Lys Leu Trp 
580 585 590 



Glu Glu Ala Phe Leu Lys Glu Met Glu Ser Phe Gin Arg Asn Thr Ser 
595 600 605 



Asp Lys Phe Gin Val Ala Phe Ser Ala Glu Arg Ser Leu Glu Asp Glu 
610 615 620 



He Asn Arg Thr Thr He Gin Asp Leu Pro Val Phe Ala Val Ser Tyr 
625 630 635 640 



He He Val Phe Leu Tyr He Ser Leu Ala Leu Gly Ser Tyr Ser Arg 
645 650 655 



Cys Ser Arg Val Ala Val Glu Ser Lys Ala Thr Leu Gly Leu Gly Gly 
660 665 " 670 



Val He Val Val Leu Gly Ala Val Leu Ala Ala Met Gly Phe Tyr Ser 
675 680 685 



Tyr Leu Gly Val Pro Ser Ser Leu Val He He Gin Val Val Pro Phe 
690 695 700 



Leu Val Leu Ala Val Gly Ala Asp Asn He Phe He Phe Val Leu Glu 
705 710 715 720 



Tyr Gin Arg Leu Pro Arg Met Pro Gly Glu Gin Arg Glu Ala His He 
725 730 735 



Gly Arg Thr Leu Gly Ser Val Ala Pro Ser Met Leu Leu Cys Ser Leu 
740 745 750 



Ser Glu Ala He Cys Phe Phe Leu Gly Ala Leu Thr Pro Met Pro Ala 
755 760 765 



Val Arg Thr Phe Ala Leu Thr Ser Gly Leu Ala lie lie Leu Asp Phe 
770 775 780 



Leu Leu Gin Met Thr Ala Phe Val Ala Leu Leu Ser Leu Asp Ser Lys 
785 790 795 800 



Arg Gin Glu Ala Ser Arg Pro Asp Val Leu Cys Cys Phe Ser Thr Arg 
805 810 ' 815 



Lys Leu Pro Pro Pro Lys Glu Lys Glu Gly Leu Leu Leu Arg Phe Phe 
820 825 830 



Arg Lys lie Tyr Ala Pro Phe Leu Leu His Arg Phe lie Arg Pro Val 
835 840 845 



Val Met Leu Leu Phe Leu Thr Leu Phe Gly Ala Asn Leu Tyr Leu Met 
850 855 860 



Cys Asn lie Asn Val Gly Leu Asp Gin Glu Leu Ala Leu Pro Lys Asp 
865 870 ' 875 880 



Ser Tyr Leu lie Asp Tyr Phe Leu Phe Leu Asn Arg Tyr Leu Glu Val 
885 890 * 895 



Gly Pro Pro Val Tyr Phe Val Thr Thr Ser Gly Phe Asn Phe Ser Ser 
900 905 910 



Glu Ala Gly Met Asn Ala Thr Cys Ser Ser Ala Gly Cys Lys Ser Phe 
915 920 925 



Ser Leu Thr Gin Lys lie Gin Tyr Ala Ser Glu Phe Pro Asp Gin Ser 
930 935 940 



Tyr Val Ala lie Ala Ala Ser Ser Trp Val Asp Asp Phe lie Asp Trp 
945 950 955 ' 960 



Leu Thr Pro Ser Ser Ser Cys Cys Arg Leu Tyr lie Arg Gly Pro His 
965 970 " 975 



Lys Asp Glu Phe Cys Pro Ser Thr Asp Thr Ser Phe Asn Cys Leu Lys 
980 985 990 



Asn Cys Met Asn Arg Thr Leu Gly Pro Val Arg Pro Thr Ala Glu Gin 
995 1000 1005 



Phe His Lys Tyr Leu Pro Trp Phe Leu Asn Asp Pro Pro Asn lie 
1010 1015 1020 



Arg Cys Pro Lys Gly Gly Leu Ala Ala Tyr Arg Thr Ser Val Asn 
1025 1030 1035 



Leu Ser Ser Asp Gly Gin Val He Ala Ser Gin Phe Met Ala Tyr 
1040 1045 1050 



His Lys Pro Leu Arg Asn Ser Gin Asp Phe Thr Glu Ala Leu Arg 
1055 1060 1065 



Ala Ser Arg Leu Leu Ala Ala Asn He Thr Ala Asp Leu Arg Lys 
1070 1075 1080 



Val Pro Gly Thr Asp Pro Asn Phe Glu Val Phe Pro Tyr Thr He 
1085 1090 1095 



Ser Asn Val Phe Tyr Gin Gin Tyr Leu Thr Val Leu Pro Glu Gly 
1100 1105 1110 



He Phe Thr Leu Ala Leu Cys Phe Val Pro Thr Phe Val Val Cys 
1115 1120 1125 



Tyr Leu Leu Leu Gly Leu Asp Met Cys Ser Gly He Leu Asn Leu 
1130 1135 " 1140 



Leu Ser He He Met He Leu Val Asp Thr He Gly Leu Met Ala 
1145 1150 1155 



Val Trp Gly He Ser Tyr Asn Ala Val Ser Leu He Asn Leu Val 
1160 1165 1170 



Thr Ala Val Gly Met Ser Val Glu Phe Val Ser His He Thr Arg 
1175 1180 1185 



Ser Phe Ala Val Ser Thr Lys Pro Thr Arg Leu Glu Arg Ala Lys 
1190 1195 ' 1200 



Asp Ala Thr Val Phe Met Gly Ser Ala Val Phe Ala Gly Val Ala 
1205 1210 1215 



Met Thr Asn Phe Pro Gly He Leu He Leu Gly Phe Ala Gin Ala 
1220 1225 1230 



Gin Leu lie Gin lie Phe Phe Phe Arg Leu Asn Leu Leu lie Thr 
1235 1240 1245 



Leu Leu Gly Leu Leu His Gly Leu Val Phe Leu Pro Val Val Leu 
1250 1255 1260 



Ser Tyr Leu Gly Pro Asp Val Asn Gin Ala Leu Val Gin Glu Glu 
1265 1270 1275 



Lys Leu Ala Ser Glu Ala Ala Val Ala Pro Glu Pro Ser Cys Pro 
1280 1285 1290 



Gin Tyr Pro Ser Pro Ala Asp Ala Asp Ala Asn Val Asn Tyr Gly 
1295 1300 1305 



Phe Ala Pro Glu Leu Ala His Gly Ala Asn Ala Ala Arg Ser Ser 
1310 1315 1320 



Leu Pro Lys Ser Asp Gin Lys Phe 
1325 1330 
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<400> 3 

atg gcg gag gcc ggc ctg agg ggc tgg ctg ctg tgg gcc ctg etc ctg 48 
Met Ala Glu Ala Gly Leu Arg Gly Trp Leu Leu Trp Ala Leu Leu Leu 
15 10 15 

cgc ttg gcc cag agt gag cct tac aca acc ate cac cag cct ggc tac 96 
Arg Leu Ala Gin Ser Glu Pro Tyr Thr Thr lie His Gin Pro Gly Tyr 
20 25 30 



tgc gcc ttc tat gac gaa tgt ggg aag aac cca gag ctg tct gga age 14 4 

Cys Ala Phe Tyr Asp Glu Cys Gly Lys Asn Pro Glu Leu Ser Gly Ser 
35 40 45 

etc atg aca etc tec aac gtg tec tgc ctg tec aac acg ccg gcc cgc 192 
Leu Met Thr Leu Ser Asn Val Ser Cys Leu Ser Asn Thr Pro Ala Arg 
50 55 60 

aag ate aca ggt gat cac ctg ate eta tta cag aag ate tgc ccc cgc 240 
Lys lie Thr Gly Asp His Leu lie Leu Leu Gin Lys lie Cys Pro Arg 
65 70 75 80 

etc tac acc ggc ccc aac ace caa gcc tgc tgc tec gcc aag cag ctg 288 
Leu Tyr Thr Gly Pro Asn Thr Gin Ala Cys Cys Ser Ala Lys Gin Leu 
85 90 95 

gta tea ctg gaa gcg agt ctg teg ate acc aag gcc etc etc acc cgc 336 
Val Ser Leu Glu Ala Ser Leu Ser lie Thr Lys Ala Leu Leu Thr Arg 
100 105 110 

tgc cca gcc tgc tct gac aat ttt gtg aac ctg cac tgc cac aac acg 384 
Cys Pro Ala Cys Ser Asp Asn Phe Val Asn Leu His Cys His Asn Thr 
115 120 125 

tgc age ccc aat cag age etc ttc ate aat gtg acc cgc gtg gcc cag 432 
Cys Ser Pro Asn Gin Ser Leu Phe lie Asn Val Thr Arg Val Ala Gin 
130 135 140 

eta ggg get gga caa etc cca get gtg gtg gcc tat gag gcc ttc tac 480 
Leu Gly Ala Gly Gin Leu Pro Ala Val Val Ala Tyr Glu Ala Phe Tyr 
145 150 155 160 

cag cat age ttt gcc gag cag age tat gac tec tgc age cgt gtg cgc 528 
Gin His Ser Phe Ala Glu Gin Ser Tyr Asp Ser Cys Ser Arg Val Arg 
165 170 175 

gtc cct gca get gcc acg ctg get gtg ggc acc atg tgt ggc gtg tat 576 
Val Pro Ala Ala Ala Thr Leu Ala Val Gly Thr Met Cys Gly Val Tyr 
180 185 190 

ggc tct gcc ctt tgc aat gcc cag cgc tgg etc aac ttc cag gga gac 624 
Gly Ser Ala Leu Cys Asn Ala Gin Arg Trp Leu Asn Phe Gin Gly Asp 
195 200 ' 205 

aca ggc aat ggt ctg gcc cca ctg gac ate acc ttc cac etc ttg gag 672 
Thr Gly Asn Gly Leu Ala Pro Leu Asp lie Thr Phe His Leu Leu Glu 
210 215 220 

cct ggc cag gcc gtg ggg agt ggg att cag cct ctg aat gag ggg gtt 720 
Pro Gly Gin Ala Val Gly Ser Gly He Gin Pro Leu Asn Glu Gly Val 
225 230 235 240 

gca cgt tgc aat gag tec caa ggt gac gac gtg gcg acc tgc tec tgc 768 
Ala Arg Cys Asn Glu Ser Gin Gly Asp Asp Val Ala Thr Cys Ser Cys 
245 " 250 255 

caa gac tgt get gca tec tgt cct gcc ata gcc cgc ccc cag gcc etc 816 
Gin Asp Cys Ala Ala Ser Cys Pro Ala He Ala Arg Pro Gin Ala Leu 
260 265 270 



gac tec acc 
Asp Ser Thr 
275 

ate ate etc 
lie lie Leu 
290 

ttc cgt gtg 
Phe Arg Val 
305 

aag ggc acc 
Lys Gly Thr 



ctt ggc cag 
Leu Gly Gin 



ctg acc ate 
Leu Thr lie 
355 

ggc ctg gtc 
Gly Leu Val 
370 

gee ccc aac 
Ala Pro Asn 
385 

ttc ggc ccc 
Phe Gly Pro 



egg tec age 
Arg Ser Ser 



age gga ate 
Ser Gly He 
435 

gag agg ctg 
Glu Arg Leu 
450 

ate tec ctg 
He Ser Leu 
465 

agt etc tac 
Ser Leu Tyr 



aac cgc acg 
Asn Arg Thr 



ttc tac ctg 
Phe Tyr Leu 



tgc tct gtc 
Cys Ser Val 



gee ccc gee 
Ala Pro Ala 
310 

age etc tct 
Ser Leu Ser 
325 

ttc ttc cag 
Phe Phe Gin 
340 

ttg gtg eta 
Leu Val Leu 



ttt aca gaa 
Phe Thr Glu 



age caa gee 
Ser Gin Ala 
390 

ttc ttc cga 
Phe Phe Arg 
405 

tac agg tat 
Tyr Arg Tyr 
420 

ctg gac ctg 
Leu Asp Leu 



egg cac etc 
Arg His Leu 



cag gac ate 
Gin Asp He 
470 

gac tgc tgc 
Asp Cys Cys 
485 

etc ctg ctg 
Leu Leu Leu 



ggc cag atg 
Gly Gin Met 
280 

ttc get gtg 
Phe Ala Val 
295 

agg gac aaa 
Arg Asp Lys 



gac aag etc 
Asp Lys Leu 



ggc tgg ggc 
Gly Trp Gly 
345 

tct gtc ate 
Ser Val He 
360 

etc act acg 
Leu Thr Thr 
375 

egg agt gag 
Arg Ser Glu 



acc aac cag 
Thr Asn Gin 



gac tct ctg 
Asp Ser Leu 
425 

gac ttg ctg 
Asp Leu Leu 
440 

cag gta tgg 
Gin Val Trp 
455 

tgc tac gee 
Cys Tyr Ala 



ate aac age 
lie Asn Ser 



etc aca gec 
Leu Thr Ala 



ccg ggc agt 
Pro Gly Ser 



gtc acc ate 
Val Thr He 
300 

age aag atg 
Ser Lys Met 
315 

age ttc tec 
Ser Phe Ser 
330 

acg tgg gtg 
Thr Trp Val 



ccg gtg gtg 
Pro Val Val 



gac ccc gtg 
Asp Pro Val 
380 

aaa get ttc 
Lys Ala Phe 
395 

gtg ate ctg 
Val He Leu 
410 

ctg ctg ggg 
Leu Leu Gly 



ctg gag ctg 
Leu Glu Leu 



teg ccc gaa 
Ser Pro Glu 
4 60 

ccc etc aat 
Pro Leu Asn 
475 

etc ctg cag 
Leu Leu Gin 
490 

aac cag aca 
Asn Gin Thr 



ctg gtc etc 
Leu Val Leu 
285 

ctg ctt gtg 
Leu Leu Val 



gtg gac ccc 
Val Asp Pro 



acc cac acc 
Thr His Thr 
335 

get teg tgg 
Ala Ser Trp 
350 

gee ttg gca 
Ala Leu Ala 
365 

gag ctg tgg 
Glu Leu Trp 



cat gac cag 
His Asp Gin 



acg get cct 
Thr Ala Pro 
415 

ccc aag aac 
Pro Lys Asn 
430 

eta gag ctg 
Leu Glu Leu 
445 

gca cag cgc 
Ala Gin Arg 



ccg gac aat 
Pro Asp Asn 



tat ttc cag 
Tyr Phe Gin 
495 

ctg atg ggg 
Leu Met Gly 



ate 864 
He 



gga 912 
Gly 



aag 960 

Lys 

320 

etc 1008 
Leu 



cct 1056 
Pro 



gcg 1104 
Ala 



teg 1152 
Ser 



cat 1200 

His 

400 

aac 1248 
Asn 



ttc 1296 
Phe 



cag 1344 
Gin 



aac 1392 
Asn 



acc 1440 

Thr 

480 

aac 1488 
Asn 



cag 1536 
Gin 



500 



505 



510 



acc tec caa 
Thr Ser Gin 
515 

ccg etc acc 
Pro Leu Thr 
530 

gac tac ggg 
Asp Tyr Gly 
545 

gga aag gac 
Gly Lys Asp 



aac aat tac 
Asn Asn Tyr 



gag gag gec 
Glu Glu Ala 
595 

ggc atg ttc 
Gly Met Phe 
610 

ate aat cgc 
lie Asn Arg 
625 

att gtc ata 
He Val He 



tgg age cga 
Trp Ser Arg 



gtg gec gtg 
Val Ala Val 
675 

tac ttg ggt 
Tyr Leu Gly 
690 

ctg gtg ctg 
Leu Val Leu 
705 

tac cag agg 
Tyr Gin Arg 



ggg cga gec 



gtc gac tgg 
Val Asp Trp 



ttc aag gat 
Phe Lys Asp 



gee cct gtc 
Ala Pro Val 
550 

tat tct gag 
Tyr Ser Glu 
565 

cct gee ggg 
Pro Ala Gly 
580 

ttc tta gag 
Phe Leu Glu 



cag gtc acg 
Gin Val Thr 



acc aca get 
Thr Thr Ala 

630 

ttc ctg tac 
Phe Leu Tyr 
645 

gtg atg gtg 
Val Met Val 
660 

gtc ctg gga 
Val Leu Gly 



ate cgc tec 
He Arg Ser 



tec gtg ggg 
Ser Val Gly 
710 

ctg ccc egg 
Leu Pro Arg 
725 

eta ggc agg 



aag gac cat 
Lys Asp His 
520 

ggc aca gee 
Gly Thr Ala 
535 

ttc ccc ttc 
Phe Pro Phe 



gca gag gee 
Ala Glu Ala 



gac ccc cgt 
Asp Pro Arg 
585 

gaa atg cga 
Glu Met Arg 
600 

ttc acg get 
Phe Thr Ala 
615 

gaa gac ctg 
Glu Asp Leu 



ate tct ctg 
He Ser Leu 



gac tec aag 
Asp Ser Lys 
665 

gca gtc atg 
Ala Val Met 
680 

tec ctg gtc 
Ser Leu Val 
695 

get gat aac 
Ala Asp Asn 



agg cct ggg 
Arg Pro Gly 



gtg get ccc 



ttt ctg tac 
Phe Leu Tyr 



ctg gee ctg 
Leu Ala Leu 
540 

ctt gee att 
Leu Ala He 
555 

ctg ate atg 
Leu He Met 
570 

ctg gee cag 
Leu Ala Gin 



gee ttc cag 
Ala Phe Gin 



gag cgc tct 
Glu Arg Ser 
620 

ccc ate ttt 
Pro He Phe 

635 

gec ctg ggc 
Ala. Leu Gly 
650 

gee acg ctg 
Ala Thr Leu 



get gec atg 
Ala Ala Met 



ate ctg caa 
He Leu Gin 
700 

ate ttc ate 
He Phe He 
715 

gag cca cga 
Glu Pro Arg 
730 

age atg ctg 



tgt gee aat 
Cys Ala Asn 
525 

age tgc atg 
Ser Cys Met 



ggg ggg tac 
Gly Gly Tyr 



acg ttc tec 
Thr Phe Ser 
575 

gee aag ctg 
Ala Lys Leu 
590 

cgt egg atg 
Arg Arg Met 
605 

ctg gaa gac 
Leu Glu Asp 



gec acc age 
Ala Thr Ser 



age tat tec 
Ser Tyr Ser 
655 

ggc etc ggc 
Gly Leu Gly 
670 

ggc ttc ttc 
Gly Phe Phe 
685 

gtg gtt cct 
Val Val Pro 



ttt gtt etc 
Phe Val Leu 



gag gtc cac 
Glu Val His 
735 

ttg tgc age 



gec 1584 
Ala 



get 1632 
Ala 



aaa 1680 

Lys 

560 

etc 1728 
Leu 



tgg 1776 
Trp 



get 1824 
Ala 



gag 1872 
Glu 



tac 1920 

Tyr 

640 

age 1968 
Ser 



ggg 2016 
Gly 



tec 2064 
Ser 



ttc 2112 
Phe 



gag 2160 

Glu 

720 

att 2208 
He 



etc 2256 



Gly Arg Ala 



tct gag gcc 
Ser Glu Ala 
755 

gtg egg acc 
Val Arg Thr 
770 

etc ctg cag 
Leu Leu Gin 
785 

agg cag gag 
Arg Gin Glu 



gag ctg ccc 
Glu Leu Pro 



caa aag get 
Gin Lys Ala 
835 

gtg ctg ctg 
Val Leu Leu 
850 

tgc cac ate 
Cys His lie 
865 

teg tac ctg 
Ser Tyr Leu 



ggg gcc ccg 
Gly Ala Pro 



gag get ggg 
Glu Ala Gly 
915 

tec ttc acc 
Ser Phe Thr 
930 

tac ctg gcc 
Tyr Leu Ala 
945 

ctg acc ccg 
Leu Thr Pro 



Leu Gly Arg 
740 

ate tgc ttc 
lie Cys Phe 



ttt gcc ctg 
Phe Ala Leu 



atg tea gcc 
Met Ser Ala 
790 

gcc tec egg 
Ala Ser Arg 
805 

ccg cct ggc 
Pro Pro Gly 
820 

tat gcc ccc 
Tyr Ala Pro 



ctg ttt etc 
Leu Phe Leu 



age gtg gga 
Ser Val Gly 
870 

ctt gac tat 
Leu Asp Tyr 
885 

gtg tac ttt 
Val Tyr Phe 
900 

atg aat gcc 
Met Asn Ala 



cag aag ate 
Gin Lys He 



ate cct gcc 
He Pro Ala 
950 

tec tec tgc 
Ser Ser Cys 
965 



Val Ala Pro 
745 

ttc eta ggg 
Phe Leu Gly 
7 60 

acc tct ggc 
Thr Ser Gly 
775 

ttt gtg gcc 
Phe Val Ala 



ttg gac gtc 
Leu Asp Val 



cag gga gag 
Gin Gly Glu 
825 

ttc ctg ctg 
Phe Leu Leu 
840 

gcc ctg ttc 
Ala Leu Phe 
855 

ctg gac cag 
Leu Asp Gin 



ttc etc ttt 
Phe Leu Phe 



gtt acc acc 
Val Thr Thr 
905 

ate tgc tec 
He Cys Ser 
920 

cag tat gcc 
Gin Tyr Ala 
935 

tec tec tgg 
Ser Ser Trp 



tgc cgc ctt 
Cys Arg Leu 



Ser Met Leu 



gcc ctg acc 
Ala Leu Thr 



ctt gca gtg 
Leu Ala Val 
780 

ctg etc tec 
Leu Leu Ser 
795 

tgc tgc tgt 
Cys Cys Cys 
810 

ggg etc ctg 
Gly Leu Leu 



cac tgg ate 
His Trp He 



gga gtg age 
Gly Val Ser 
860 

gag ctg gcc 
Glu Leu Ala 
875 

ctg aac cgc 
Leu Asn Arg 
890 

ttg ggc tac 
Leu Gly Tyr 



agt gca ggc 
Ser Ala Gly 



aca gag ttc 
Thr Glu Phe 
940 

gtg gat gac 
Val Asp Asp 
955 

tat ata tct 
Tyr He Ser 
970 



Leu Cys Ser 
750 

ccc atg cca 
Pro Met Pro 
765 

ate ctt gac 
He Leu Asp 



ctg gac age 
Leu Asp Ser 



gtc aag ccc 
Val Lys Pro 
815 

ctt ggc ttc 
Leu Gly Phe 
830 

act cga ggt 
Thr Arg Gly 
845 

etc tac tec 
Leu Tyr Ser 



ctg ccc aag 
Leu Pro Lys 



tac ttc gag 
Tyr Phe Glu 
895 

aac ttc tec 
Asn Phe Ser 
910 

tgc aac aac 
Cys Asn Asn 
925 

cct gag cag 
Pro Glu Gin 



ttc att gac 
Phe lie Asp 



ggc ccc aat 
Gly Pro Asn 
975 



Leu 



get 2304 
Ala 



ttc 2352 
Phe 



aag 2400 

Lys 

800 

cag 24 4 8 

Gin 



ttc 2496 
Phe 



gtt 2544 
Val 



atg 2592 
Met 



gac 2640 

Asp 

880 

gtg 2688 
Val 



age 2736 
Ser 



ttc 2784 
Phe 



tct 2832 
Ser 



tgg 2880 

Trp 

960 

aag 2928 
Lys 



gac aag ttc tgc ccc teg acc gtc aac tct ctg aac tgc eta aag aac 
Asp Lys Phe Cys Pro Ser Thr Val Asn Ser Leu Asn Cys Leu Lys Asn 
980 985 990 



2976 



tgc atg age ate acg atg ggc tct gtg agg ccc teg gtg gag cag ttc 
Cys Met Ser lie Thr Met Gly Ser Val Arg Pro Ser Val Glu Gin Phe 
995 1000 ~ 1005 



3024 



cat aag tat ctt ccc tgg ttc ctg aac gac egg ccc aac ate aaa 
His Lys Tyr Leu Pro Trp Phe Leu Asn Asp Arg Pro Asn lie Lys 
1010 1015 * 1020 



3069 



tgt ccc aaa ggc ggc ctg gca gca tac age acc tct gtg aac ttg 
Cys Pro Lys Gly Gly Leu Ala Ala Tyr Ser Thr Ser Val Asn Leu 
1025 1030 1035 



3114 



act tea gat ggc cag gtt tta gec tec agg ttc atg gee tat cac 
Thr Ser Asp Gly Gin Val Leu Ala Ser Arg Phe Met Ala Tyr His 
1040 1045 1050 



3159 



aag ccc ctg aaa aac tea cag 
Lys Pro Leu Lys Asn Ser Gin 
1055 ' 1060 



gat tac aca gaa get 
Asp Tyr Thr Glu Ala 
1065 



ctg egg gca 
Leu Arg Ala 



3204 



get cga gag ctg gca gee aac ate act get gac ctg egg aaa gtg 
Ala Arg Glu Leu Ala Ala Asn lie Thr Ala Asp Leu Arg Lys Val 
1070 1075 1080 



3249 



cct gga aca gac ccg get ttt gag gtc ttc ccc tac acg ate acc 
Pro Gly Thr Asp Pro Ala Phe Glu Val Phe Pro Tyr Thr lie Thr 
1085 1090 1095 



3294 



aat gtg ttt tat gag cag tac 
Asn Val Phe Tyr Glu Gin Tyr 
1100 1105 



ctg acc ate etc cct 
Leu Thr lie Leu Pro 
1110 



gag ggg etc 
Glu Gly Leu 



3339 



ttc atg etc age etc tgc ctt gtg ccc acc ttc get gtc tec tgc 
Phe Met Leu Ser Leu Cys Leu Val Pro Thr Phe Ala Val Ser Cys 
1115 1120 1125 



3384 



etc ctg ctg ggc ctg gac ctg cgc tec ggc etc etc aac ctg etc 
Leu Leu Leu Gly Leu Asp Leu Arg Ser Gly Leu Leu Asn Leu Leu 
1130 1135 1140 



3429 



tec att gtc atg ate etc gtg gac act gtc ggc ttc atg gee ctg 
Ser lie Val Met lie Leu Val Asp Thr Val Gly Phe Met Ala Leu 
1145 1150 " 1155 



3474 



tgg gac ate agt tac aat get gtg tec etc ate aac ctg gtc teg 
Trp Asp lie Ser Tyr Asn Ala Val Ser Leu lie Asn Leu Val Ser 
1160 1165 1170 



3519 



gcg gtg ggc atg tct gtg gag ttt gtg tec cac att acc cgc tec 
Ala Val Gly Met Ser Val Glu Phe Val Ser His lie Thr Arg Ser 
1175 1180 1185 



3564 



ttt gee ate age acc aag ccc acc tgg ctg gag agg gee aaa gag 
Phe Ala lie Ser Thr Lys Pro Thr Trp Leu Glu Arg Ala Lys Glu 
1190 1195 1200 



3609 



gcc acc ate tct atg gga agt gcg gtg ttt gca ggt gtg gec atg 3654 
Ala Thr He Ser Met Gly Ser Ala Val Phe Ala Gly Val Ala Met 
1205 1210 1215 

acc aac ctg cct ggc ate ctt gtc ctg ggc etc gcc aag gcc cag 3699 
Thr Asn Leu Pro Gly He Leu Val Leu Gly Leu Ala Lys Ala Gin 
1220 1225 1230 

etc att cag ate ttc ttc ttc cgc etc aac etc ctg ate act ctg 3744 
Leu He Gin He Phe Phe Phe Arg Leu Asn Leu Leu He Thr Leu 
1235 1240 1245 

ctg ggc ctg ctg cat ggc ttg gtc ttc ctg ccc gtc ate etc age 3789 
Leu Gly Leu Leu His Gly Leu Val Phe Leu Pro Val He Leu Ser 
1250 1255 1260 

tac gtg ggg cct gac gtt aac ccg get ctg gca ctg gag cag aag 3834 
Tyr Val Gly Pro Asp Val Asn Pro Ala Leu Ala Leu Glu Gin Lys 
1265 1270 1275 

egg get gag gag gcg gtg gca gca gtc atg gtg gcc tct tgc cca 3879 
Arg Ala Glu Glu Ala Val Ala Ala Val Met Val Ala Ser Cys Pro 
1280 1285 1290 

aat cac ccc tec cga gtc tec aca get gac aac ate tat gtc aac 3924 
Asn His Pro Ser Arg Val Ser Thr Ala Asp Asn He Tyr Val Asn 
1295 1300 1305 

cac age ttt gaa ggt tct ate aaa ggt get ggt gcc ate age aac 3969 
His Ser Phe Glu Gly Ser He Lys Gly Ala Gly Ala He Ser Asn 
1310 1315 1320 

ttc ttg ccc aac aat ggg egg cag ttc tga 3999 
Phe Leu Pro Asn Asn Gly Arg Gin Phe 
1325 1330 

<210> 4 

<211> 1332 

<212> PRT 

<213> Homo sapiens 



<400> 4 

Met Ala Glu Ala Gly Leu Arg Gly Trp Leu Leu Trp Ala Leu Leu Leu 
1 5 10 15 



Arg Leu Ala Gin Ser Glu Pro Tyr Thr Thr He His Gin Pro Gly Tyr 
20 25 30 



Cys Ala Phe Tyr Asp Glu Cys Gly Lys Asn Pro Glu Leu Ser Gly Ser 



35 



40 



45 



Leu Met Thr Leu Ser Asn Val Ser Cys Leu Ser Asn Thr Pro Ala Arg 
50 55 60 



Lys lie Thr Gly Asp His Leu lie Leu Leu Gin Lys lie Cys Pro Arg 
65 70 75 80 



Leu Tyr Thr Gly Pro Asn Thr Gin Ala Cys Cys Ser Ala Lys Gin Leu 
85 .90 J 95 



Val Ser Leu Glu Ala Ser Leu Ser lie Thr Lys Ala Leu Leu Thr Arg 
100 105 110 



Cys Pro Ala Cys Ser Asp Asn Phe Val Asn Leu His Cys His Asn Thr 
115 120 125 



Cys Ser Pro Asn Gin Ser Leu Phe lie Asn Val Thr Arg Val Ala Gin 
130 135 140 



Leu Gly Ala Gly Gin Leu Pro Ala Val Val Ala Tyr Glu Ala Phe Tyr 
145 150 155 160 



Gin His Ser Phe Ala Glu Gin Ser Tyr Asp Ser Cys Ser Arg Val Arg 
165 170 175 



Val Pro Ala Ala Ala Thr Leu Ala Val Gly Thr Met Cys Gly Val Tyr 
180 185 190 



Gly Ser Ala Leu Cys Asn Ala Gin Arg Trp Leu Asn Phe Gin Gly Asp 
195 200 205 



Thr Gly Asn Gly Leu Ala Pro Leu Asp lie Thr Phe His Leu Leu Glu 
210 215 220 



Pro Gly Gin Ala Val Gly Ser Gly lie Gin Pro Leu Asn Glu Gly Val 
225 230 235 240 



Ala Arg Cys Asn Glu Ser Gin Gly Asp Asp Val Ala Thr Cys Ser Cys 
245 250 255 



Gin Asp Cys Ala Ala Ser Cys Pro Ala lie Ala Arg Pro Gin Ala Leu 
260 265 270 



Asp Ser Thr Phe Tyr Leu Gly Gin Met Pro Gly Ser Leu Val Leu lie 
275 280 285 



He He Leu Cys Ser Val Phe Ala Val Val Thr He Leu Leu Val Gly 
290 295 300 



Phe Arg Val Ala Pro Ala Arg Asp Lys Ser Lys Met Val Asp Pro Lys 
305 310 315 320 



Lys Gly Thr Ser Leu Ser Asp Lys Leu Ser Phe Ser Thr His Thr Leu 
325 330 335 



Leu Gly Gin Phe Phe Gin Gly Trp Gly Thr Trp Val Ala Ser Trp Pro 
340 345 350 



Leu Thr He Leu Val Leu Ser Val He Pro Val Val Ala Leu Ala Ala 
355 360 365 



Gly Leu Val Phe Thr Glu Leu Thr Thr Asp Pro Val Glu Leu Trp Ser 
370 375 ~ 380 



Ala Pro Asn Ser Gin Ala Arg Ser Glu Lys Ala Phe His Asp Gin His 
385 390 395 400 



Phe Gly Pro Phe Phe Arg Thr Asn Gin Val He Leu Thr Ala Pro Asn 
405 410 415 



Arg Ser Ser Tyr Arg Tyr Asp Ser Leu Leu Leu Gly Pro Lys Asn Phe 
420 425 ~ 430 



Ser Gly He Leu Asp Leu Asp Leu Leu Leu Glu Leu Leu Glu Leu Gin 
435 440 445 



Glu Arg Leu Arg His Leu Gin Val Trp Ser Pro Glu Ala Gin Arg Asn 
450 455 460 



He Ser Leu Gin Asp He Cys Tyr Ala Pro Leu Asn Pro Asp Asn Thr 
465 470 475 ' 480 



Ser Leu Tyr Asp Cys Cys He Asn Ser Leu Leu Gin Tyr Phe Gin Asn 
485 490 495 



Asn Arg Thr Leu Leu Leu Leu Thr Ala Asn Gin Thr Leu Met Gly Gin 
500 505 510 



Thr Ser Gin Val Asp Trp Lys Asp His Phe Leu Tyr Cys Ala Asn Ala 
515 520 525 



Pro Leu Thr Phe Lys Asp Gly Thr Ala Leu Ala Leu Ser Cys Met Ala 
530 535 540 



Asp Tyr Gly Ala Pro Val Phe Pro Phe Leu Ala lie Gly Gly Tyr Lys 
545 550 555 560 



Gly Lys Asp Tyr Ser Glu Ala Glu Ala Leu lie Met Thr Phe Ser Leu 
565 570 575 



Asn Asn Tyr Pro Ala Gly Asp Pro Arg Leu Ala Gin Ala Lys Leu Trp 
580 585 590 



Glu Glu Ala Phe Leu Glu Glu Met Arg Ala Phe Gin Arg Arg Met Ala 
595 600 605 



Gly Met Phe Gin Val Thr Phe Thr Ala Glu Arg Ser Leu Glu Asp Glu 
610 615 620 



lie Asn Arg Thr Thr Ala Glu Asp Leu Pro lie Phe Ala Thr Ser Tyr 
625 630 635 640 



lie Val lie Phe Leu Tyr lie Ser Leu Ala Leu Gly Ser Tyr Ser Ser 
645 650 655 



Trp Ser Arg Val Met Val Asp Ser Lys Ala Thr Leu Gly Leu Gly Gly 
660 665 670 



Val Ala Val Val Leu Gly Ala Val Met Ala Ala Met Gly Phe Phe Ser 
675 680 685 



Tyr Leu Gly lie Arg Ser Ser Leu Val lie Leu Gin Val Val Pro Phe 
690 695 700 



Leu Val Leu Ser Val Gly Ala Asp Asn lie Phe lie Phe Val Leu Glu 
705 710 715 720 



Tyr Gin Arg Leu Pro Arg Arg Pro Gly Glu Pro Arg Glu Val His lie 
725 730 735 



Gly Arg Ala Leu Gly Arg Val Ala Pro Ser Met Leu Leu Cys Ser Leu 
740 745 750 



Ser Glu Ala He Cys Phe Phe Leu Gly Ala Leu Thr Pro Met Pro Ala 
755 760 765 



Val Arg Thr Phe Ala Leu Thr Ser Gly Leu Ala Val He Leu Asp Phe 
770 775 780 



Leu Leu Gin Met Ser Ala Phe Val Ala Leu Leu Ser Leu Asp Ser Lys 
785 790 795 ' 800 



Arg Gin Glu Ala Ser Arg Leu Asp Val Cys Cys Cys Val Lys Pro Gin 
805 810 815 



Glu Leu Pro Pro Pro Gly Gin Gly Glu Gly Leu Leu Leu Gly Phe Phe 
820 825 830 



Gin Lys Ala Tyr Ala Pro Phe Leu Leu His Trp He Thr Arg Gly Val 
835 840 845 



Val Leu Leu Leu Phe Leu Ala Leu Phe Gly Val Ser Leu Tyr Ser Met 
850 855 860 



Cys His He Ser Val Gly Leu Asp Gin Glu Leu Ala Leu Pro Lys Asp 
865 870 875 ' 880 



Ser Tyr Leu Leu Asp Tyr Phe Leu Phe Leu Asn Arg Tyr Phe Glu Val 
885 890 895 



Gly Ala Pro Val Tyr Phe Val Thr Thr Leu Gly Tyr Asn Phe Ser Ser 
900 905 ' 910 



Glu Ala Gly Met Asn Ala He Cys Ser Ser Ala Gly Cys Asn Asn Phe 
915 920 925 



Ser Phe Thr Gin Lys He Gin Tyr Ala Thr Glu Phe Pro Glu Gin Ser 
930 935 940 



Tyr Leu Ala He Pro Ala Ser Ser Trp Val Asp Asp Phe He Asp Trp 
945 950 955 960 



Leu Thr Pro Ser Ser Cys Cys Arg Leu Tyr He Ser Gly Pro Asn Lys 
965 970 * 975 



Asp Lys Phe Cys Pro Ser Thr Val Asn Ser Leu Asn Cys Leu Lys Asn 



980 



985 



990 



Cys Met Ser lie Thr Met Gly Ser Val Arg Pro Ser Val Glu Gin Phe 
995 1000 1005 



His Lys Tyr Leu Pro Trp Phe Leu Asn Asp Arg Pro Asn lie Lys 
1010 1015 ' 1020 



Cys Pro Lys Gly Gly Leu Ala Ala Tyr Ser Thr Ser Val Asn Leu 
1025 1030 1035 



Thr Ser Asp Gly Gin Val Leu Ala Ser Arg Phe Met Ala Tyr His 
1040 1045 1050 



Lys Pro Leu Lys Asn Ser Gin Asp Tyr Thr Glu Ala Leu Arg Ala 
1055 1060 1065 



Ala Arg Glu Leu Ala Ala Asn lie Thr Ala Asp Leu Arg Lys Val 
1070 1075 ~ 1080 



Pro Gly Thr Asp Pro Ala Phe Glu Val Phe Pro Tyr Thr lie Thr 
1085 1090 1095 



Asn Val Phe Tyr Glu Gin Tyr Leu Thr lie Leu Pro Glu Gly Leu 
1100 1105 1110 



Phe Met Leu Ser Leu Cys Leu Val Pro Thr Phe Ala Val Ser Cys 
1115 1120 1125 



Leu Leu Leu Gly Leu Asp Leu Arg Ser Gly Leu Leu Asn Leu Leu 
1130 1135 ' 1140 



Ser He Val Met He Leu Val Asp Thr Val Gly Phe Met Ala Leu 
1145 1150 1155 



Trp Asp He Ser Tyr Asn Ala Val Ser Leu He Asn Leu Val Ser 
1160 1165 1170 



Ala Val Gly Met Ser Val Glu Phe Val Ser His He Thr Arg Ser 
1175 1180 1185 



Phe Ala He Ser Thr Lys Pro Thr Trp Leu Glu Arg Ala Lys Glu 
1190 1195 1200 



Ala Thr He Ser Met Gly Ser Ala Val Phe Ala Gly Val Ala Met 
1205 1210 1215 



Thr Asn Leu Pro Gly He Leu Val Leu Gly Leu Ala Lys Ala Gin 
1220 1225 1230 



Leu He Gin He Phe Phe Phe Arg Leu Asn Leu Leu He Thr Leu 
1235 1240 1245 



Leu Gly Leu Leu His Gly Leu Val Phe Leu Pro Val He Leu Ser 
1250 1255 1260 



Tyr Val Gly Pro Asp Val Asn Pro Ala Leu Ala Leu Glu Gin Lys 
1265 1270 1275 



Arg Ala Glu Glu Ala Val Ala Ala Val Met Val Ala Ser Cys Pro 
1280 1285 1290 



Asn His Pro Ser Arg Val Ser Thr Ala Asp Asn He Tyr Val Asn 
1295 1300 1305 



His Ser Phe Glu Gly Ser He Lys Gly Ala Gly Ala He Ser Asn 
1310 1315 1320 



Phe Leu Pro Asn Asn Gly Arg Gin Phe 
1325 1330 



<210> 5 

<211> 885 

<212> DNA 

<213> Rattus sp. 

<400> 5 



ccacgcgtcc 


gcacctgcaa 


gtgtggtccc 


ctgaggcaga 


gcgcaacatc 


tccctccagg 


60 


acatctgcta 


tgcccccctc 


aacccatata 


acaccagcct 


ctccgactgc 


tgtgtcaaca 


120 


gcctccttca 


gtacttccag 


aacaaccgca 


ccctcctgat 


gctcacggcc 


aaccagactc 


180 


tgaatggcca 


gacctccctg 


gtggactgga 


aggaccattt 


cctctactgt 


gcaaatgccc 


240 


ctctcacgtt 


caaagatggc 


acgtctctgg 


ccctgagctg 


catggctgac 


tacggggctc 


300 


ctgtcttccc 


cttccttgct 


gttgggggat 


accaaggcac 


ggactattcc 


gaggcagaag 


360 



cgctgatcat 


aaccttctct 


ctcaataact 


accccgctga 


tgatccccgc 


atggcccagg 


420 


ccaagctctg 


ggaggaggct 


ttcttgaagg 


aaatggaatc 


cttccagagg 


aacacaagtg 


480 


acaagttcca 


ggttgcgttc 


tcagctgagc 


gctctctgga 


ggatgagatc 


aaccgcacca 


540 


ccatccagga 


cctgcctgtc 


tttgccgtca 


gctacattat 


cgtcttcctg 


tacatctccc 


600 


tggccctggg 


cagctactcc 


agatgcagcc 


gagtagcggt 


ggagtccaag 


gctactctgg 


660 


gcctaggtgg 


ggtgatagtg 


tgctgggagc 


agttctggct 


tgcatggggc 


ttctaactcc 


720 


tacctgggtg 


tcccctcttc 


tctggttatc 


atccaagtgg 


tacctttcct 


ggtgcttaag 


780 


ctgtgggagc 


tggacacatc 


tacatcctag 


acttgagtac 


cagaggtacc 


taggaagccg 


840 


cggaacagcg 


aaaaggacac 


attgggcgca 


ccctgggcat 


gtggc 




885 



<210> 6 

<211> 458 

<212> DNA 

<213> Rattus sp. 



<400> 6 
gaccagatgt 


taaccaagct 


ctggtacagg 


aggagaaact 


agccagcgag 


gcagcagtgg 


60 


ccccagagcc 


ttcttgccca 


cagtacccct 


cccctgctga 


tgcggatgcc 


aatgttaact 


120 


acggctttgc 


cccagaactt 


gcccacggag 


ctaatgctgc 


tagaagctct 


ttgcccaaaa 


180 


gtgaccaaaa 


gttctaatgg 


agtaggagct 


tgtccatgct 


tctgctgatg 


agggatcatg 


240 


aaggtcttcc 


ctctggttgt 


cctcaaggcc 


tggggggagg 


ttgttcagag 


aaaaatggct 


300 


ggcattcctg 


ccacgaggca 


accggcagct 


tggcactgac 


tccttggtct 


cataggtccc 


360 


taaggcttgg 


tcagattact 


cctcatggag 


agactatctt 


aagtatctaa 


gctatcgatt 


420 


gggatgcatc 


gctgttcatt 


aaaaaggcta 


tggctatg 






458 



<210> 7 

<211> 896 

<212> DNA 

<213> Rattus sp. 



<400> 7 

ccacgcgtcc gcagtttcat aagtacctgc cctggttcct gaatgatccg cccaatatca 



60 



gatgtcccaa 


agggggtcta 


gcagcgtata 


gaacgtctgt 


gaatttgagc 


tcagatggcc 


120 


aggttatagc 


ctcccagttc 


atggcctacc 


acaagccctt 


aaggaactca 


caggacttca 


180 


cagaagctct 


ccgggcgtcc 


cggttgctag 


cagccaacat 


cacagctgac 


ctacggaagg 


240 


tgcctgggac 


agatccaaac 


tttgaggtct 


tcccttacac 


gatctccaac 


gtgttctacc 


300 


agcaatacct 


gacggtcctt 


cctgagggaa 


tcttcaccct 


tgctctttgc 


tttgtgccca 


360 


cctttgttgt 


ctgctacctc 


ctactgggcc 


tggacatgtg 


ctcagggatc 


ctcaacctac 


420 


tctccatcat 


tatgattctc 


gtggacacca 


ttggcctcat 


ggctgtgtgg 


ggtatcagct 


480 


ataatgcggt 


atccctcatc 


aaccttgtca 


cggcagtggg 


catgtctgtg 


gagtttgtgt 


540 


cccacatcac 


tcggtccttt 


gcttgtaagc 


accaagccta 


cccggctgga 


gagggctaaa 


600 


agatgctact 


gtcttcatgg 


gcagtgcggt 


gtttgctgga 


gtggccatga 


ccaacttccc 


660 


aggcatcctc 


atcttggggg 


ctttgcccca 


agcccaggct 


tattcagatc 


ttcttcttcc 


720 


gcctcaacct 


tctgatcacc 


tttgctgggg 


tctgctgcat 


ggctggtctt 


cctgcccggt 


780 


ttgtcctcag 


ctatctggga 


ccagatgtaa 


ccaaggctct 


gctacccgga 


ggagaaacta 


840 


gccagcgagg 


gcagcagtgg 


ccccagagac 


ttcttgccca 


caagtaccct 


tccctg 


896 


<210> 8 














<211> 3124 












<212> DNA 














<213> Rattus sp. 












<400> 8 
tgcaagtgtg 


gtcccctgag 


gcagagcgca 


acatctccct 


ccaggacatc 


tgctatgccc 


60 


ccctcaaccc 


atataacacc 


agcctctccg 


actgctgtgt 


caacagcctc 


cttcagtact 


120 


tccagaacaa 


ccgcaccctc 


ctgatgctca 


cggccaacca 


gactctgaat 


ggccagacct 


180 


ccctggtgga 


ctggaaggac 


catttcctct 


actgtgcaaa 


tgcccctctc 


acgttcaaag 


240 


atggcacgtc 


tctggccctg 


agctgcatgg 


ctgactacgg 


ggctcctgtc 


ttccccttcc 


300 


ttgctgttgg 


gggataccaa 


ggcacggact 


attccgaggc 


agaagcgctg 


atcataacct 


360 


tctctctcaa 


taactacccc 


gctgatgatc 


cccgcatggc 


ccaggccaag 


ctctgggagg 


420 


aggctttctt 


gaaggaaatg 


gaatccttcc 


agaggaacac 


aagtgacaag 


ttccaggttg 


480 


cgttctcagc 


tgagcgctct 


ctggaggatg 


agatcaaccg 


caccaccatc 


caggacctgc 


540 



ctgtctttgc 


cgtcagctac 


attatcgtct 


tcctgtacat 


ctccctggcc 


ctgggcagct 


600 


actccagatg 


cagccgagta 


gcggtggagt 


ccaaggctac 


tctgggccta 


ggtggggtga 


660 


ttgttgtgct 


gggagcagtt 


ctggctgcca 


tgggcttcta 


ctcctacctg 


ggtgtcccct 


720 


cttctctggt 


tatcatccaa 


gtggtacctt 


tcctggtgct 


agctgtggga 


gctgacaaca 


780 


tcttcatctt 


tgttcttgag 


taccagaggc 


tacctaggat 


gcctggggaa 


cagcgagagg 


840 


ctcacattgg 


ccgcaccctg 


ggcagtgtgg 


cccccagcat 


gctgctgtgc 


agcctctctg 


900 


aggccatctg 


cttctttcta 


ggggccctga 


cccccatgcc 


agctgtgagg 


accttcgcct 


960 


tgacctctgg 


cttagcaatt 


atcctcgact 


tcctgctcca 


gatgactgcc 


tttgtggccc 


1020 


tgctctccct 


ggatagcaag 


aggcaggagg 


cctctcgccc 


ggatgtctta 


tgctgctttt 


1080 


caacccggaa 


gctgccccca 


cctaaagaaa 


aagaaggcct 


cttactccgc 


ttcttccgca 


1140 


agatatacgc 


tcctttcctg 


ctgcacagat 


tcatccgccc 


tgttgtgatg 


ctgctgtttc 


1200 


tgaccctgtt 


tggagcaaat 


ctctacttaa 


tgtgcaacat 


caacgtgggg 


ctagaccagg 


1260 


agctggctct 


gcccaaggac 


tcgtacttga 


tagactactt 


cctctttctg 


aaccgatacc 


1320 


ttgaagtggg 


gcctccagtg 


tactttgtca 


ccacctcggg 


cttcaacttc 


tccagcgagg 


1380 


caggcatgaa 


cgccacttgc 


tctagcgcag 


gctgtaagag 


cttctcccta 


acccagaaaa 


1440 


tccagtatgc 


cagtgaattc 


cctgaccagt 


cttacgtggc 


tattgctgca 


tcctcctggg 


1500 


tagatgactt 


catcgactgg 


ctgaccccgt 


cctcctcctg 


ctgtcgcctt 


tatatacgtg 


1560 


gcccccataa 


ggatgagttc 


tgtccctcaa 


cggatacttc 


cttcaactgc 


ttaaaaaact 


1620 


gcatgaaccg 


cactctgggt 


cctgtgaggc 


ccacagcgga 


acagtttcat 


aagtacctgc 


1680 


cctggttcct 


gaatgatccg 


cccaatatca 


gatgtcccaa 


agggggtcta 


gcagcgtata 


1740 


gaacgtctgt 


gaatttgagc 


tcagatggcc 


aggttatagc 


ctcccagttc 


atggcctacc 


1800 


acaagccctt 


aaggaactca 


caggacttca 


cagaagctct 


ccgggcgtcc 


cggttgctag 


1860 


cagccaacat 


cacagctgac 


ctacggaagg 


tgcctgggac 


agatccaaac 


tttgaggtct 


1920 


tcccttacac 


gatctccaac 


gtgttctacc 


agcaatacct 


gacggtcctt 


cctgagggaa 


1980 


tcttcaccct 


tgctctttgc 


tttgtgccca 


cctttgttgt 


ctgctacctc 


ctactgggcc 


2040 


tggacatgtg 


ctcagggatc 


ctcaacctac 


tctccatcat 


tatgattctc 


gtggacacca 


2100 


ttggcctcat 


ggctgtgtgg 


ggtatcagct 


ataatgcggt 


atccctcatc 


aaccttgtca 


2160 


cggcagtggg 


catgtctgtg 


gagtttgtgt 


cccacatcac 


tcggtccttt 


gctgtaagca 


2220 


ccaagcctac 


ccggctggag 


agggctaaag 


atgctactgt 


cttcatgggc 


agtgcggtgt 


2280 


ttgctggagt 


ggccatgacc 


aacttcccag 


gcatcctcat 


cttgggcttt 


gcccaagccc 


2340 



agcttattca 


gatcttcttc 


ttccgcctca 


accttctgat 


caccttgctg 


ggtctgctgc 

J ZJ ZJ ZJ 


2400 


atggcctggt 


cttcctgccg 


gttgtcctca 


gctatctggg 


accagatgtt 


aaccaagctc 


2460 


tggtacagga 


ggagaaacta 


gccagcgagg 


cagcagtggc 


cccagagcct 


tcttgcccac 


2520 


agtacccctc 


ccctgctgat 


gcggatgcca 


atgttaacta 


cggctttgcc 


ccagaacttg 


2580 


cccacggagc 


taatgctgct 


agaagctctt 


tgcccaaaag 


tgaccaaaag 


ttctaatgga 


2640 


gtaggagctt 


gtccatgctt 


cttgctgatg 


agggatcatg 

ZJ ZJ -J ZJ 


aaggtcttcc 


ctctggttgt 


2700 


cctcaaggcc 


tggggggagg 


ttgtttcaga 


gaaaaatggc 


tggcattcct 


gccacgagqc 


2760 


aaccggcagc 


attggcactg 


acctccttgc 


tctcataggt 


ccctaaggcc 


ttggtcagat 


2820 


tacctcctcc 


atggagagac 


tatcttaagt 


atcttaagta 


tcqtatggqa 

" ZJ ZJ ZJ 


tgcatcgcct 


2880 


gtcaattaaa 


aaggctatgg 


cctatggctc 


aggcagggcc 


atccggaaga 


agagaggatt 


2940 


ctgggataaa 


gccaggtggg 


agattcgcct 


ggggaaaatg 


tgacaatggt 


tcctgagcat 


3000 


gggcaatcag 


ccatgtggca 


gaatgtaaat 


taatataaat 


gggttgtctt 


aagttatgat 


3060 


tctagctggg 


gaggagccta 


gctgtgtagc 


caagatattt 


gtaaatataa 


aaaaaaaaaa 


3120 


aaaa 












3124 



<210> 9 

<211> 4484 

<212> DNA 

<213> Rattus sp. 



<400> 9 

atggcagctg cctggctggg atggctgctc tgggccctgc tcctgagcgc ggcccagggt 60 

gagctataca cacccaaaca cgaagctggg gtctgcacct tttacgaaga gtgcgggaaa 120 

aacccagagc tctctggagg cctcacgtca ctatccaatg tatcctgcct gtctaacacc 180 

ccggcccgcc acgtcacggg tgaacacctg gctcttctcc agcgcatctg tccccgcctg 240 

tacaacggcc ccaataccac ttttgcctgt tgctctacca agcagctgct gtccttagaa 300 

agcagcatgt ccatcaccaa ggcccttctc acgcgctgcc cggcctgctc tgacaatttt 360 

gtgagcttac actgccacaa cacttgcagc cctgaccaga gcctcttcat caacgtcacc 420 

cgggtggttg agcggggcgc tggagagcct cctgccgtgg tggcctatga ggccttttat 480 

cagcgcagct ttgctgagaa ggcctatgag tcctgcagcc aggtgcgcat ccctgcggcc 540 



gcttccttgg 


ccgtgggcag 


catgtgtgga 


gtgtatggct 


ccgccctctg 


caatgctcag 


600 


cgctggctca 


acttccaagg 


agacacaggg 


aatggcctgg 


ctccgctgga 


tatcaccttc 


660 


cacctcttgg 


agcctggcca 


ggccctaccg 


gatgggatcc 


agccactgaa 


tgggaagatc 


720 


gcaccctgca 


acgagtctca 


gggtgatgac 


tcagcagtct 


gctcctgcca 


ggactgtgcg 


780 


gcgtcctgcc 


ctgtcatccc 


tccgcccgag 


gccttgcgcc 


cttccttcta 


catgggtcgc 


840 


atgccaggct 


ggctggccct 


catcatcatc 


ttcactgctg 


tctttgtgtt 


gctctctgca 


900 


gtccttgtgc 


gtctccgagt 


ggtttccaac 


aggaacaaga 


acaaggcaga 


aggcccccag 


960 


gaagccccca 


aactccctca 


taagcacaaa 


ctctcacccc 


ataccatcct 


gggccggttc 


1020 


ttccagaact 


ggggcacaag 


ggtggcctcg 


tggccactca 


ccgtcttagc 


actgtccttc 


1080 


atcgttgtga 


tagccttagc 


agcaggcctg 


acctttattg 


aactcaccac 


agaccctgtg 


1140 


gaactgtggt 


cggcccccaa 


gagccaggcc 


cggaaagaga 


agtctttcca 


tgatgagcat 


1200 


ttcggcccct 


tctttcgaac 


caaccagatt 


ttcgtgacag 


ctcggaacag 


gtccagctac 


1260 


aagtacgact 


ccctactgct 


agggtccaag 


aacttcagtg 


ggatcctgtc 


cctggacttc 


1320 


ctgctggagc 


tgctggagct 


tcaggagagg 


cttcgacacc 


tgcaagtgtg 


gtcccctgag 


1380 


gcagagcgca 


acatctccct 


ccaggacatc 


tgctatgccc 


ccctcaaccc 


atataacacc 


1440 


agcctctccg 


actgctgtgt 


caacagcctc 


cttcagtact 


tccagaacaa 


ccgcaccctc 


1500 


ctgatgctca 


cggccaacca 


gactctgaat 


ggccagacct 


ccctggtgga 


ctggaaggac 


1560 


catttcctct 


actgtgcaaa 


tgcccctctc 


acgttcaaag 


atggcacgtc 


tctggccctg 


1620 


agctgcatgg 


ctgactacgg 


ggctcctgtc 


ttccccttcc 


ttgctgttgg 


gggataccaa 


1680 


ggcacggact 


attccgaggc 


agaagcgctg 


atcataacct 


tctctctcaa 


taactacccc 


1740 


gctgatgatc 


cccgcatggc 


ccaggccaag 


ctctgggagg 


aggctttctt 


gaaggaaatg 


1800 


gaatccttcc 


agaggaacac 


aagtgacaag 


ttccaggttg 


cgttctcagc 


tgagcgctct 


1860 


ctggaggatg 


agatcaaccg 


caccaccatc 


caggacctgc 


ctgtctttgc 


cgtcagctac 


1920 


attatcgtct 


tcctgtacat 


ctccctggcc 


ctgggcagct 


actccagatg 


cagccgagta 


1980 


gcggtggagt 


ccaaggctac 


tctgggccta 


ggtggggtga 


ttgttgtgct 


gggagcagtt 


2040 


ctggctgcca 


tgggcttcta 


ctcctacctg 


ggtgtcccct 


cttctctggt 


tatcatccaa 


2100 


gtggtacctt 


tcctggtgct 


agctgtggga 


gctgacaaca 


tcttcatctt 


tgttcttgag 


2160 


taccagaggc 


tacctaggat 


gcctggggaa 


cagcgagagg 


ctcacattgg 


ccgcaccctg 


2220 


ggcagtgtgg 


cccccagcat 


gctgctgtgc 


agcctctctg 


aggccatctg 


cttctttcta 


2280 


ggggccctga 


cccccatgcc 


agctgtgagg 


accttcgcct 


tgacctctgg 


cttagcaatt 


2340 



atcctcgact 


tcctgctcca 


gatgactgcc 


tttgtggccc 


tgctctccct 


ggatagcaag 


2400 


aggcaggagg 


cctctcgccc 


ggatgtctta 


tgctgctttt 


caacccggaa 


gctgccccca 


2460 


cctaaagaaa 


aagaaggcct 


cttactccgc 


ttcttccgca 


agatatacgc 


tcctttcctg 


2520 


ctgcacagat 


tcatccgccc 


tgttgtgatg 


ctgctgtttc 


tgaccctgtt 


tggagcaaat 


2580 


ctctacttaa 


tgtgcaacat 


caacgtgggg 


ctagaccagg 


agctggctct 


gcccaaggac 


2640 


tcgtacttga 


tagactactt 


cctctttctg 


aaccgatacc 


ttgaagtggg 


gcctccagtg 


2700 


tactttgtca 


ccacctcggg 


cttcaacttc 


tccagcgagg 


caggcatgaa 


cgccacttgc 


2760 


tctagcgcag 


gctgtaagag 


cttctcccta 


acccagaaaa 


tccagtatgc 


cagtgaattc 


2820 


cctgaccagt 


cttacgtggc 


tattgctgca 


tcctcctggg 


tagatgactt 


catcgactgg 


2880 


ctgaccccgt 


cctcctcctg 


ctgtcgcctt 


tatatacgtg 


gcccccataa 


ggatgagttc 


2940 


tgtccctcaa 


cggatacttc 


cttcaactgc 


ttaaaaaact 


gcatgaaccg 


cactctgggt 


3000 


cctgtgaggc 


ccacagcgga 


acagtttcat 


aagtacctgc 


cctggttcct 


gaatgatccg 


3060 


cccaatatca 


gatgtcccaa 


agggggtcta 


gcagcgtata 


gaacgtctgt 


gaatttgagc 


3120 


tcagatggcc 


aggttatagc 


ctcccagttc 


atggcctacc 


acaagccctt 


aaggaactca 


3180 


caggacttca 


cagaagctct 


ccgggcgtcc 


cggttgctag 


cagccaacat 


cacagctgac 


3240 


ctacggaagg 


tgcctgggac 


agatccaaac 


tttgaggtct 


tcccttacac 


gatctccaac 


3300 


gtgttctacc 


agcaatacct 


gacggtcctt 


cctgagggaa 


tcttcaccct 


tgctctttgc 


3360 


tttgtgccca 


cctttgttgt 


ctgctacctc 


ctactgggcc 


tggacatgtg 


ctcagggatc 


3420 


ctcaacctac 


tctccatcat 


tatgattctc 


gtggacacca 


ttggcctcat 


qqctqtqtqq 


3480 


ggtatcagct 


ataatgcggt 


atccctcatc 


aaccttgtca 


cggcagtggg 


catgtctgtg 


3540 


gagtttgtgt 


cccacatcac 


tcggtccttt 


gctgtaagca 


ccaagcctac 


ccggctggag 


3600 


agggctaaag 


atgctactgt 


cttcatgggc 


agtgcggtgt 


ttgctqqaqt 


ggccatgacc 


3660 


aacttcccag 


gcatcctcat 


cttgggcttt 


gcccaagccc 


agcttattca 


gatcttcttc 


3720 


ttccgcctca 


accttctgat 


caccttgctg 


qqtctqctqc 


atqqcctqqt 


cttcctgccg 


3780 


gt tgtcctca 


gctatctggg 


accagatgtt 


aaccaagctc 


tggtacagga 


ggagaaacta 


3840 


gccagcgagg 


cagcagtggc 


cccagagcct 


tcttgcccac 


agtacccctc 


ccctgctgat 


3900 


gcggatgcca 


atgttaacta 


cggctttgcc 


ccagaacttg 


cccacggagc 


taatgctgct 


3960 


agaagctctt 


tgcccaaaag 


tgaccaaaag 


ttctaatgga 


gtaggagctt 


gtccatgctt 


4020 


cttgctgatg 


agggatcatg 


aaggtcttcc 


ctctggttgt 


cctcaaggcc 


tggggggagg 


4080 



ttgtttcaga 


gaaaaatggc 


tggcattcct 


gccacgaggc 


aaccggcagc 


attggcactg 


4140 


acctccttgc 


tctcataggt 


ccctaaggcc 


ttggtcagat 


tacctcctcc 


atggagagac 


4200 


tatcttaagt 


atcttaagta 


tcgtatggga 


tgcatcgcct 


gtcaattaaa 


aaggctatgg 


4260 


cctatggctc 


aggcagggcc 


atccggaaga 


agagaggatt 


ctgggataaa 


gccaggtggg 


4320 


agattcgcct 


ggggaaaatg 


tgacaatggt 


tcctgagcat 


gggcaatcag 


ccatgtggca 


4380 


gaatgtaaat 


taatataaat 


gggttgtctt 


aagttatgat 


tctagctggg 


gaggagccta 


4440 


gctgtgtagc 


caagatattt 


gtaaatataa 


aaaaaaaaaa 


aaaa 
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<210> 10 

<211> 3993 

<212> DNA 

<213> Rattus sp. 

<220> 

<221> misc_f eature 

<222> (1)..(3993) 

<223> n is g or a or t or c 

<400> 10 



atggcngcng 


cntggytngg 


ntggytnytn 


tgggcnytny 


tnytnwsngc 


ngcncarggn 


60 


garytntaya 


cnccnaarca 


ygargcnggn 


gtntgyacnt 


tytaygarga 


rtgyggnaar 


120 


aayccngary 


tnwsnggngg 


nytnacnwsn 


ytnwsnaayg 


tnwsntgyyt 


nwsnaayacn 


180 


ccngcnmgnc 


aygtnacngg 


ngarcayytn 


gcnytnytnc 


armgnathtg 


yccnmgnytn 


240 


tayaayggnc 


cnaayacnac 


nttygcntgy 


tgywsnacna 


arcarytnyt 


nwsnytngar 


300 


wsnwsnatgw 


snathacnaa 


rgcnytnytn 


acnmgntgyc 


cngcntgyws 


ngayaaytty 


360 


gtnwsnytnc 


aytgycayaa 


yacntgywsn 


ccngaycarw 


snytnttyat 


haaygtnacn 


420 


mgngtngtng 


armgnggngc 


nggngarccn 


ccngcngtng 


tngcntayga 


rgcnttytay 


480 


carmgnwsnt 


tygcngaraa 


rgcntaygar 


wsntgywsnc 


argtnmgnat 


hccngcngcn 


540 


gcnwsnytng 


cngtnggnws 


natgtgyggn 


gtntayggnw 


sngcnytntg 


yaaygcncar 


600 


mgntggytna 


ayttycargg 


ngayacnggn 


aayggnytng 


cnccnytnga 


yathacntty 


660 


cayytnytng 


arccnggnca 


rgcnytnccn 


gayggnathc 


arccnytnaa 


yggnaarath 


720 


gcnccntgya 


aygarwsnca 


rggngaygay 


wsngcngtnt 


gywsntgyca 


rgaytgygcn 


780 


gcnwsntgyc 


cngtnathcc 


nccnccngar 


gcnytnmgnc 


cnwsnttyta 


yatgggnmgn 


840 



atgccnggnt 


ggytngcnyt 


nathathath 


ttyacngcng 


tnttygtnyt 


nytnwsngcn 


900 


gtnytngtnm 


gnytnmgngt 


ngtnwsnaay 


mgnaayaara 


ayaargcnga 


rggnccncar 


960 


gargcnccna 


arytnccnca 


yaarcayaar 


ytnwsnccnc 


ayacnathyt 


nggnmgntty 


1020 


ttycaraayt 


ggggnacnmg 


ngtngcnwsn 


tggccnytna 


cngtnytngc 


nytnwsntty 


1080 


athgtngtna 


thgcnytngc 


ngcnggnytn 


acnttyathg 


arytnacnac 


ngayccngtn 


1140 


garytntggw 


sngcnccnaa 


rwsncargcn 


mgnaargara 


arwsnttyca 


yqayqarcay 


1200 


ttyggnccnt 


tyttymgnac 


naaycarath 


ttygtnacng 


cnmgnaaymg 


nwsnwsntay 


1260 


aartaygayw 


snytnytnyt 


nggnwsnaar 


aayttywsng 


gnathytnws 


nytngaytty 


1320 


ytnytngary 


tnytngaryt 


ncargarmgn 


ytnmgncayy 


tncargtntg 


gwsnccngar 


1380 


gcngarmgna 


ayathwsnyt 


ncargayath 


tgytaygcnc 

ZJ J J ^ 


cnytnaaycc 


ntayaayacn 


1440 


wsnytnwsng 


aytgytgygt 


naaywsnytn 


ytncartayt 


tycaraayaa 


ymgnacnytn 


1500 


ytnatgytna 


cngcnaayca 


racnytnaay 


ggncaracnw 


snytngtnga 


ytggaargay 


1560 


cayttyytnt 


aytgygcnaa 


ygcnccnytn 


acnttyaarg 


ayggnacnws 


nvtnacnvtn 


1620 


wsntgyatgg 


cnaavtavQQ 


ngcnccngtn 


ttyccnttyy 


tngcngtngg 


naantavra t 

** * Y -L. 


1680 


ggnacngayt 


aywsngargc 


ngargcnytn 


athathacnt 


tywsnytnaa 


yaaytayccn 


1740 


gcngaygayc 


cnmgnatggc 


ncargcnaar 


ytntqqqarq 


argcnttyyt 


naargaratg 


1800 


garwsnttyc 


armgnaayac 


nwsngayaar 


ttycargtng 


cnttywsngc 


ngarmgnwsn 


1860 


ytngargayg 


arathaaymg 


nacnacnath 


cargayytnc 


cngtnttygc 


ngtnwsntay 


1920 


athathgtnt 


tyytntayat 


hwsnytngcn 


ytnggnwsnt 


aywsnmgntg 


ywsnmgngtn 


1980 


gcngtngarw 


snaargcnac 


nytnggnytn 


ggnggngtna 


thgtngtnyt 


nggngcngtn 


2040 


ytngcngcna 


tgggnttyta 


ywsntayytn 


ggngtnccnw 


snwsnytngt 


nathathcar 


2100 


gtngtnccnt 


tyytngtnyt 


ngcngtnqqn 


qcnqavaava 


thttyathtt 


ygtnytngar 


2160 


taycarmgny 


tnccnmgnat 


gccnggngar 


carmgngarg 


cncayathgg 


nmcinflrnvtn 
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2220 


ggnwsngtng 


cnccnwsnat 


gytnytntgy 


wsnytnwsng 








ggngcnytna 


cnccnatgcc 


ngcngtnmgn 


acnttygcny 


tnacnwsngg 


nytngcnath 


2340 


athytngayt 


tyytnytnca 


ratgacngcn 


ttygtngcny 


tnytnwsnyt 


ngaywsnaar 


2400 


mgncargarg 


cnwsnmgncc 


ngaygtnytn 


tgytgyttyw 


snacnmgnaa 


rytnccnccn 


2460 


ccnaargara 


argarggnyt 


nytnytnmgn 


ttyttymgna 


arathtaygc 


nccnttyytn 


2520 


ytncaymgnt 


tyathmgncc 


ngtngtnatg 


ytnytnttyy 


tnacnytntt 


yggngcnaay 


2580 



ytntayytna 


tgtgyaayat 


haaygtnggn 


ytngaycarg 


arytngcnyt 


nccnaargay 


2640 


wsntayytna 


thgaytaytt 


yytnttyytn 


aaymgntayy 


tngargtngg 


nccnccngtn 


2700 


tayttygtna 


cnacnwsngg 


nttyaaytty 


wsnwsngarg 


cnggnatgaa 


ygcnacntgy 


2760 


wsnwsngcng 


gntgyaarws 


nttywsnytn 


acncaraara 


thcartaygc 


nwsngartty 


2820 


ccngaycarw 


sntaygtngc 


nathgcngcn 


wsnwsntggg 


tngaygaytt 


yathgaytgg 


2880 


ytnacnccnw 


snwsnwsntg 


ytgymgnytn 


tayathmgng 


gnccncayaa 


rgaygartty 


2940 


tgyccnwsna 


cngayacnws 


nttyaaytgy 


ytnaaraayt 


gyatgaaymg 


nacnytnggn 


3000 


ccngtnmgnc 


cnacngcnga 


rcarttycay 


aartayytnc 


cntggttyyt 


naaygayccn 


3060 


ccnaayathm 


gntgyccnaa 


rggnggnytn 


gcngcntaym 


gnacnwsngt 


naayytnwsn 


3120 


wsngayggnc 


argtnathgc 


nwsncartty 


atggcntayc 


ayaarccnyt 


nmgnaaywsn 


3180 


cargayttya 


cngargcnyt 


nmgngcnwsn 


mgnytnytng 


cngcnaayat 


hacngcngay 


3240 


ytnmgnaarg 


tnccnggnac 


ngayccnaay 


ttygargtnt 


tyccntayac 


nathwsnaay 


3300 


gtnttytayc 


arcartayyt 


nacngtnytn 


ccngarggna 


thttyacnyt 


ngcnytntgy 


3360 


ttygtnccna 


cnttygtngt 


ntgytayytn 


ytnytnggny 


tngayatgtg 


ywsnggnath 


3420 


ytnaayytny 


tnwsnathat 


hatgathytn 


gtngayacna 


thggnytnat 


ggcngtntgg 


3480 


ggnathwsnt 


ayaaygcngt 


nwsnytnath 


aayytngtna 


cngcngtngg 


natgwsngtn 


3540 


garttygtnw 


sncayathac 


nmgnwsntty 


gcngtnwsna 


cnaarccnac 


nmgnytngar 


3600 


mgngcnaarg 


aygcnacngt 


nttyatgggn 


wsngcngtnt 


tygcnggngt 


ngcnatgacn 


3660 


aayttyccng 


gnathytnat 


hytnggntty 


gcncargcnc 


arytnathca 


rathttytty 


3720 


ttymgnytna 


ayytnytnat 


hacnytnytn 


ggnytnytnc 


ayggnytngt 


nttyytnccn 


3780 


gtngtnytnw 


sntayytngg 


nccngaygtn 


aaycargcny 


tngtncarga 


rgaraarytn 


3840 


gcnwsngarg 


cngcngtngc 


nccngarccn 


wsntgyccnc 


artayccnws 


nccngcngay 


3900 


gcngaygcna 


aygtnaayta 


yggnttygcn 


ccngarytng 


cncayggngc 


naaygcngcn 


3960 


mgnwsnwsny 


tnccnaarws 


ngaycaraar 


tty 
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<210> 11 

<211> 4002 

<212> DNA 

<213> Mus sp. 



<220> 

<221> CDS 

<222> (1)..(4002) 

<223> 



<400> 11 

atg gca get gec tgg cag gga tgg ctg etc tgg gee ctg etc ctg aat 48 
Met Ala Ala Ala Trp Gin Gly Trp Leu Leu Trp Ala Leu Leu Leu Asn 
15 10 15 

teg gee cag ggt gag etc tac aca ccc act cac aaa get ggc ttc tgc 96 
Ser Ala Gin Gly Glu Leu Tyr Thr Pro Thr His Lys Ala Gly Phe Cys 
20 25 30 

acc ttt tat gaa gag tgt ggg aag aac cca gag ctt tct gga ggc etc 14 4 

Thr Phe Tyr Glu Glu Cys Gly Lys Asn Pro Glu Leu Ser Gly Gly Leu 
35 40 45 

aca tea eta tec aat ate tec tgc ttg tct aat acc cca gee cgc cat 192 
Thr Ser Leu Ser Asn lie Ser Cys Leu Ser Asn Thr Pro Ala Arg His 
50 55 60 

gtc aca ggt gac cac ctg get ctt etc cag cgc gtc tgt ccc cgc eta 240 
Val Thr Gly Asp His Leu Ala Leu Leu Gin Arg Val Cys Pro Arg Leu 
65 70 75 80 

tac aat ggc ccc aat gac acc tat gee tgt tgc tct acc aag cag ctg 288 
Tyr Asn Gly Pro Asn Asp Thr Tyr Ala Cys Cys Ser Thr Lys Gin Leu 
85 90 95 

gtg tea tta gac agt age ctg tct ate acc aag gee etc ctt aca cgc 336 
Val Ser Leu Asp Ser Ser Leu Ser lie Thr Lys Ala Leu Leu Thr Arg 
100 105 ' 110 

tgc ccg gca tgc tct gaa aat ttt gtg age ata cac tgt cat aat acc 384 
Cys Pro Ala Cys Ser Glu Asn Phe Val Ser He His Cys His Asn Thr 
115 120 125 

tgc age cct gac cag age etc ttc ate aat gtt act cgc gtg gtt cag 432 
Cys Ser Pro Asp Gin Ser Leu Phe He Asn Val Thr Arg Val Val Gin 
130 135 140 

egg gac cct gga cag ctt cct get gtg gtg gee tat gag gee ttt tat 480 
Arg Asp Pro Gly Gin Leu Pro Ala Val Val Ala Tyr Glu Ala Phe Tyr 
145 150 155 160 

caa cgc agt ttt gca gag aag gee tat gag tec tgt age egg gtg cgc 528 
Gin Arg Ser Phe Ala Glu Lys Ala Tyr Glu Ser Cys Ser Arg Val Arg 
165 170 175 

ate cct gca get gee teg ctg get gtg ggc age atg tgt gga gtg tat 576 
He Pro Ala Ala Ala Ser Leu Ala Val Gly Ser Met Cys Gly Val Tyr 
180 185 190 



ggc tct gcc 
Gly Ser Ala 
195 

aca ggg aat 
Thr Gly Asn 
210 

cct ggc cag 
Pro Gly Gin 
225 

aca ccc tgc 
Thr Pro Cys 



cag gac tgt 
Gin Asp Cys 



cgc cct tct 
Arg Pro Ser 
275 

ate ate ttc 
He He Phe 
290 

etc cga gtg 
Leu Arg Val 
305 

gaa gcc ccc 
Glu Ala Pro 



ctt ggc egg 
Leu Gly Arg 



etc act gtc 
Leu Thr Val 
355 

ggc ctg acc 
Gly Leu Thr 
370 

gcc cct aaa 
Ala Pro Lys 
385 

ttt ggc ccc 
Phe Gly Pro 



agg tec age 
Arg Ser Ser 



etc tgc aat 
Leu Cys Asn 



ggc ctg get 
Gly Leu Ala 



gcc ctg gca 
Ala Leu Ala 
230 

aat gag tec 
Asn Glu Ser 
245 

gca gca tec 
Ala Ala Ser 
260 

ttc tac atg 
Phe Tyr Met 



act get gtc 
Thr Ala Val 



get tec aac 
Ala Ser Asn 
310 

aac etc cct 
Asn Leu Pro 
325 

ttc ttc gag 
Phe Phe Glu 
340 

ttg gca ctg 
Leu Ala Leu 



ttt ata gaa 
Phe He Glu 



age caa gcc 
Ser Gin Ala 
390 

ttc ttc cga 
Phe Phe Arg 
405 

tac aag tac 
Tyr Lys Tyr 
420 



get cag cgc 
Ala Gin Arg 
200 

ccg ctg gac 
Pro Leu Asp 
215 

gat ggg atg 
Asp Gly Met 



cag ggt gaa 
Gin Gly Glu 



tgc cct gtc 
Cys Pro Val 
265 

ggt cga atg 
Gly Arg Met 
280 

ttt gta ttg 
Phe Val Leu 
295 

agg aac aag 
Arg Asn Lys 



cgt aag cgc 
Arg Lys Arg 



age tgg gga 
Ser Trp Gly 
345 

tec ttc ata 
Ser Phe He 
360 

etc acc aca 
Leu Thr Thr 
375 

egg aaa gaa 
Arg Lys Glu 



acc aac cag 
Thr Asn Gin 



gac tec ctg 
Asp Ser Leu 
425 



tgg etc aac 
Trp Leu Asn 



ate acc ttc 
He Thr Phe 
220 

aag cca ctg 
Lys Pro Leu 
235 

gac teg gca 
Asp Ser Ala 
250 

ate cct ccg 
He Pro Pro 



cca ggc tgg 
Pro Gly Trp 



etc tct gtt 
Leu Ser Val 
300 

aac aag aca 
Asn Lys Thr 
315 

aga ttc tea 
Arg Phe Ser 
330 

aca agg gtg 
Thr Arg Val 



gtt gtg ata 
Val Val He 



gac cct gtg 
Asp Pro Val 
380 

aag get ttc 
Lys Ala Phe 
395 

att ttt gtg 
He Phe Val 
410 

ctg eta ggg 
Leu Leu Gly 



ttc caa gga 
Phe Gin Gly 
205 

cac etc ttg 
His Leu Leu 



gat ggg aag 
Asp Gly Lys 



gcc tgt tec 
Ala Cys Ser 
255 

ccc ccg gcc 
Pro Pro Ala 
270 

ctg get etc 
Leu Ala Leu 
285 

gtc ctt gtg 
Val Leu Val 



gca ggc tec 
Ala Gly Ser 



cct cac act 
Pro His Thr 
335 

gcc tea tgg 
Ala Ser Trp 
350 

gcc ttg tea 
Ala Leu Ser 
365 

gaa ctg tgg 
Glu Leu Trp 



cat gac gag 
His Asp Glu 



aca get aag 
Thr Ala Lys 
415 

ccc aag aac 
Pro Lys Asn 
430 



gac 624 
Asp 



gag 672 
Glu 



ate 720 

He 

240 

tgc 768 
Cys 



ctg 816 
Leu 



ate 864 
He 



tat 912 
Tyr 



cag 960 

Gin 

320 

gtc 1008 
Val 



cca 1056 
Pro 



gta 1104 
Val 



teg 1152 
Ser 



cat 1200 

His 

400 

aac 1248 
Asn 



ttc 1296 
Phe 



agt ggg ate eta tec ctg gac ttg ctg cag gag ctg ttg gag eta cag 1344 
Ser Gly lie Leu Ser Leu Asp Leu Leu Gin Glu Leu Leu Glu Leu Gin 
435 440 445 

gag aga ctt cga cac ctg caa gtg tgg tec cat gag gca cag cgc aac 1392 
Glu Arg Leu Arg His Leu Gin Val Trp Ser His Glu Ala Gin Arg Asn 
450 455 460 

ate tec etc cag gac ate tgc tat get ccc etc aac ccg cat aac ace 1440 
lie Ser Leu Gin Asp lie Cys Tyr Ala Pro Leu Asn Pro His Asn Thr 
465 470 475 480 

age etc act gac tgc tgt gtc aac age etc ctt caa tac ttc cag aac 1488 
Ser Leu Thr Asp Cys Cys Val Asn Ser Leu Leu Gin Tyr Phe Gin Asn 
485 490 495 

aac cac aca etc ctg ctg etc aca gec aat cag act ctg aat ggc cag 1536 
Asn His Thr Leu Leu Leu Leu Thr Ala Asn Gin Thr Leu Asn Gly Gin 
500 505 510 

acc tec ctg gtg gac tgg aag gac cat ttc etc tac tgt gee aat gec 1584 
Thr Ser Leu Val Asp Trp Lys Asp His Phe Leu Tyr Cys Ala Asn Ala 
515 520 525 

cct etc acg tac aaa gat ggc aca gee ctg gec ctg age tgc ata get 1632 
Pro Leu Thr Tyr Lys Asp Gly Thr Ala Leu Ala Leu Ser Cys He Ala 
530 535 540 

gac tac ggg gca cct gtc ttc ccc ttc ctt get gtt ggg ggc tac caa 1680 
Asp Tyr Gly Ala Pro Val Phe Pro Phe Leu Ala Val Gly Gly Tyr Gin 
545 550 555 560 

ggg acg gac tac teg gag gca gaa gee ctg ate ata acc ttc tct ate 1728 
Gly Thr Asp Tyr Ser Glu Ala Glu Ala Leu He He Thr Phe Ser He 
565 570 575 

aat aac tac ccc get gat gat ccc cgc atg gee cac gee aag etc tgg 1776 
Asn Asn Tyr Pro Ala Asp Asp Pro Arg Met Ala His Ala Lys Leu Trp 
580 585 590 

gag gag get ttc ttg aag gaa atg caa tec ttc cag aga age aca get 1824 
Glu Glu Ala Phe Leu Lys Glu Met Gin Ser Phe Gin Arg Ser Thr Ala 
595 600 605 

gac aag ttc cag att gcg ttc tea get gag cgt tct ctg gag gac gag 1872 
Asp Lys Phe Gin He Ala Phe Ser Ala Glu Arg Ser Leu Glu Asp Glu 
610 615 620 

ate aat cgc act acc ate cag gac ctg cct gtc ttt gee ate age tac 1920 
He Asn Arg Thr Thr He Gin Asp Leu Pro Val Phe Ala He Ser Tyr 
625 630 635 640 

ctt ate gtc ttc ctg tac ate tec ctg gee ctg ggc age tac tec aga 1968 
Leu He Val Phe Leu Tyr lie Ser Leu Ala Leu Gly Ser Tyr Ser Arg 
645 650 655 

tgg age cga gtt gcg gtg gat tec aag get act ctg ggc eta ggt ggg 2016 
Trp Ser Arg Val Ala Val Asp Ser Lys Ala Thr Leu Gly Leu Gly Gly 



660 



665 



670 



gtg get gtt 
Val Ala Val 
675 

tac ctg ggt 
Tyr Leu Gly 
690 

ctg gtg ctg 
Leu Val Leu 
705 

tac cag agg 
Tyr Gin Arg 



ggc cgc acc 
Gly Arg Thr 



tct gag gec 
Ser Glu Ala 
755 

gtg agg acc 
Val Arg Thr 
770 

ctg etc cag 
Leu Leu Gin 
785 

agg cag gag 
Arg Gin Glu 



aat ctg ccc 
Asn Leu Pro 



cgc aag ata 
Arg Lys lie 
835 

gtg ctg ctg 
Val Leu Leu 
850 

tgc aac ate 
Cys Asn lie 
865 

tec tac ctg 
Ser Tyr Leu 



ggg cct cca 



gtg ctg gga 
Val Leu Gly 



gtc ccc tec 
Val Pro Ser 



get gtg gga 
Ala Val Gly 
710 

ctg cct agg 
Leu Pro Arg 
725 

ctg ggt agt 
Leu Gly Ser 
740 

ate tgc ttc 
lie Cys Phe 



ttt gee ttg 
Phe Ala Leu 



atg aca gec 
Met Thr Ala 
790 

gec tct cgc 
Ala Ser Arg 
805 

cca ccg aaa 
Pro Pro Lys 
820 

tac act ccc 
Tyr Thr Pro 



etc ttt ctg 
Leu Phe Leu 



age gtg ggg 
Ser Val Gly 
870 

ata gac tac 
lie Asp Tyr 
885 

gtg tac ttt 



gca gtc gtc 
Ala Val Val 
680 

tct ctg gtc 
Ser Leu Val 
695 

get gac aac 
Ala Asp Asn 



atg ccc ggg 
Met Pro Gly 



gtg gee ccc 
Val Ala Pro 
745 

ttt eta ggg 
Phe Leu Gly 
760 

acc tct ggc 
Thr Ser Gly 
775 

ttt gtg gee 
Phe Val Ala 



ccc gac gtc 
Pro Asp Val 



caa aaa gaa 
Gin Lys Glu 
825 

ttc ctg ctg 
Phe Leu Leu 
840 

gtc ctg ttt 
Val Leu Phe 
855 

ctg gac cag 
Leu Asp Gin 



ttc etc ttt 
Phe Leu Phe 



gac acc acc 



get gec atg 
Ala Ala Met 



ate att caa 
He He Gin 
700 

ate ttc ate 
He Phe He 
715 

gag cag cga 
Glu Gin Arg 
730 

age atg ctg 
Ser Met Leu 



gec ctg acc 
Ala Leu Thr 



tta gca ate 
Leu Ala He 
780 

ctg etc tec 
Leu Leu Ser 
795 

gtg tgc tgc 
Val Cys Cys 
810 

ggc etc tta 
Gly Leu Leu 



cac aga ttc 
His Arg Phe 



gga gca aac 
Gly Ala Asn 
860 

gat ctg get 
Asp Leu Ala 
875 

ctg aac egg 
Leu Asn Arg 
890 

tea ggc tac 



ggc ttc tac 
Gly Phe Tyr 
685 

gtg gta cct 
Val Val Pro 



ttt gtt ctt 
Phe Val Leu 



gag get cac 
Glu Ala His 
735 

ctg tgc age 
Leu Cys Ser 
750 

tec atg cca 
Ser Met Pro 
765 

ate ttt gac 
He Phe Asp 



ctg gat age 
Leu Asp Ser 



ttt tea age 
Phe Ser Ser 
815 

ctt tgc ttc 
Leu Cys Phe 
830 

ate cgc cct 
He Arg Pro 
845 

etc tac tta 
Leu Tyr Leu 



ctg ccc aag 
Leu Pro Lys 



tac ttg gaa 
Tyr Leu Glu 
895 

aac ttt tec 



tec 2064 
Ser 



ttc 2112 
Phe 



gag 2160 

Glu 

720 

att 2208 
He 



etc 2256 
Leu 



get 2304 
Ala 



ttc 2352 
Phe 



aag 2400 

Lys 

800 

cga 2448 
Arg 



ttc 2496 
Phe 



gtt 2544 
Val 



atg 2592 
Met 



gat 2640 

Asp 

880 

gtg 2688 
Val 



acc 2736 



Gly Pro Pro Val Tyr Phe Asp Thr Thr Ser Gly Tyr Asn Phe Ser Thr 
900 ' 905 " 910 



gag gca ggc atg aac gcc att tgc tct agt gca ggc tgt gag age ttc 
Glu Ala Gly Met Asn Ala lie Cys Ser Ser Ala Gly Cys Glu Ser Phe 
915 920 925 

tec eta ace cag aaa ate cag tat gcc agt gaa ttc cct aat cag tct 
Ser Leu Thr Gin Lys He Gin Tyr Ala Ser Glu Phe Pro Asn Gin Ser 
930 935 940 

tat gtg get att get gca tec tec tgg gta gat gac ttc ate gac tgg 
Tyr Val Ala He Ala Ala Ser Ser Trp Val Asp Asp Phe He Asp Trp 
945 950 955 960 

ctg acc cca tec tec tec tgc tgc cgc att tat acc cgt ggc ccc cat 
Leu Thr Pro Ser Ser Ser Cys Cys Arg He Tyr Thr Arg Gly Pro His 
965 970 " 975 

aaa gat gag ttc tgt ccc tea acg gat act tec ttc aac tgt etc aaa 
Lys Asp Glu Phe Cys Pro Ser Thr Asp Thr Ser Phe Asn Cys Leu Lys 
980 985 990 

aac tgc atg aac cgc act ctg ggt ccc gtg aga ccc aca aca gaa ca 
Asn Cys Met Asn Arg Thr Leu Gly Pro Val Arg Pro Thr Thr Glu Gl 
995 1000 ' 1005 

ttt cat aag tac ctg ccc tgg ttc ctg aat gat acg ccc aac ate 
Phe His Lys Tyr Leu Pro Trp Phe Leu Asn Asp Thr Pro Asn He 
1010 1015 1020 

aga tgt cct aaa ggg ggc eta gca gcg tat aga acc tct gtg aat 
Arg Cys Pro Lys Gly Gly Leu Ala Ala Tyr Arg Thr Ser Val Asn 
1025 1030 1035 

ttg age tea gat ggc cag att ata gcc tec cag ttc atg gcc tac 
Leu Ser Ser Asp Gly Gin He He Ala Ser Gin Phe Met Ala Tyr 
1040 1045 1050 

cac aag ccc tta egg aac tea cag gac ttt aca gaa get etc egg 
His Lys Pro Leu Arg Asn Ser Gin Asp Phe Thr Glu Ala Leu Arg 
1055 1060 1065 

gca tec egg ttg eta gca gcc aac ate aca get gaa eta egg aag 
Ala Ser Arg Leu Leu Ala Ala Asn He Thr Ala Glu Leu Arg Lys 
1070 1075 1080 

gtg cct ggg aca gat ccc aac ttt gag gtc ttc cct tac acg ate 
Val Pro Gly Thr Asp Pro Asn Phe Glu Val Phe Pro Tyr Thr He 
1085 1090 1095 

tec aat gtg ttc tac cag caa tac ctg acg gtt etc cct gag gga 
Ser Asn Val Phe Tyr Gin Gin Tyr Leu Thr Val Leu Pro Glu Gly 
1100 1105 1110 

ate ttc act ctt get etc tgc ttc gtg ccc acc ttt gtg gtc tgc 
He Phe Thr Leu Ala Leu Cys Phe Val Pro Thr Phe Val Val Cys 
1115 1120 1125 



tac etc eta ctg ggc ctg gac ata cgc tea ggc ate etc aac ctg 
Tyr Leu Leu Leu Gly Leu Asp He Arg Ser Gly He Leu Asn Leu 
1130 1135 ' 1140 



3429 



etc tec ate att atg ate etc gtg gac ace ate ggc etc atg get 
Leu Ser He He Met He Leu Val Asp Thr He Gly Leu Met Ala 
1145 1150 1155 



3474 



gtg tgg ggt ate age tac aat get gtg tec etc ate aac ctt gtc 
Val Trp Gly He Ser Tyr Asn Ala Val Ser Leu He Asn Leu Val 
1160 1165 1170 



3519 



acg gca gtg ggc atg tct gtg gag ttc gtg tec cac att acc egg 
Thr Ala Val Gly Met Ser Val Glu Phe Val Ser His He Thr Arg 
1175 1180 1185 



3564 



tec ttt get gta age acc aag cct acc egg ctg gag aga gee aaa 
Ser Phe Ala Val Ser Thr Lys Pro Thr Arg Leu Glu Arg Ala Lys 
1190 1195 1200 



3609 



gat get act ate ttc atg ggc 
Asp Ala Thr He Phe Met Gly 
1205 1210 



agt gcg gtg ttt get 
Ser Ala Val Phe Ala 
1215 



gga gtg gec 
Gly Val Ala 



3654 



atg acc aac ttc ccg ggc ate etc ate ctg ggc ttt get cag gee 
Met Thr Asn Phe Pro Gly He Leu He Leu Gly Phe Ala Gin Ala 
1220 1225 1230 ' 



3699 



cag ctt ate cag att ttc ttc ttc cgc etc aac etc ctg ate acc 
Gin Leu He Gin He Phe Phe Phe Arg Leu Asn Leu Leu He Thr 
1235 1240 ' 1245 



3744 



ttg ctg ggt ctg eta cac ggc ctg gtc ttc ctg ccc gtt gtc etc 
Leu Leu Gly Leu Leu His Gly Leu Val Phe Leu Pro Val Val Leu 
1250 1255 1260 



3789 



age tat ctg ggg cca gat gtt aac caa get ctg gta ctg gag gag 
Ser Tyr Leu Gly Pro Asp Val Asn Gin Ala Leu Val Leu Glu Glu 
1265 1270 1275 



3834 



aaa eta gee act gag gca gec atg gtc tea gag cct tct tgc cca 
Lys Leu Ala Thr Glu Ala Ala Met Val Ser Glu Pro Ser Cys Pro 
1280 1285 1290 



3879 



cag tac ccc ttc ccg get gat gca aac acc agt gac tat gtt aac 
Gin Tyr Pro Phe Pro Ala Asp Ala Asn Thr Ser Asp Tyr Val Asn 
1295 1300 1305 



3924 



tac ggc ttt aat cca gaa ttt ate cct gaa att aat get get age 
Tyr Gly Phe Asn Pro Glu Phe He Pro Glu He Asn Ala Ala Ser 
1310 1315 1320 



3969 



age tct ctg ccc aaa agt gac caa aag ttc taa 
Ser Ser Leu Pro Lys Ser Asp Gin Lys Phe 
1325 1330 



4002 



<210> 12 



<211> 1333 
<212> PRT 
<213> Mus sp. 

<400> 12 

Met Ala Ala Ala Trp Gin Gly Trp Leu Leu Trp Ala Leu Leu Leu Asn 
15 10 15 



Ser Ala Gin Gly Glu Leu Tyr Thr Pro Thr His Lys Ala Gly Phe Cys 
20 25 30 



Thr Phe Tyr Glu Glu Cys Gly Lys Asn Pro Glu Leu Ser Gly Gly Leu 
35 4 0 4 5 



Thr Ser Leu Ser Asn lie Ser Cys Leu Ser Asn Thr Pro Ala Arg His 
50 55 60 



Val Thr Gly Asp His Leu Ala Leu Leu Gin Arg Val Cys Pro Arg Leu 
65 70 75 80 



Tyr Asn Gly Pro Asn Asp Thr Tyr Ala Cys Cys Ser Thr Lys Gin Leu 
85 90 ' 95 



Val Ser Leu Asp Ser Ser Leu Ser lie Thr Lys Ala Leu Leu Thr Arg 
100 105 110 



Cys Pro Ala Cys Ser Glu Asn Phe Val Ser lie His Cys His Asn Thr 
115 120 125 



Cys Ser Pro Asp Gin Ser Leu Phe lie Asn Val Thr Arg Val Val Gin 
130 135 140 



Arg Asp Pro Gly Gin Leu Pro Ala Val Val Ala Tyr Glu Ala Phe Tyr 
145 150 155 160 



Gin Arg Ser Phe Ala Glu Lys Ala Tyr Glu Ser Cys Ser Arg Val Arg 
165 170 ' 175 



He Pro Ala Ala Ala Ser Leu Ala Val Gly Ser Met Cys Gly Val Tyr 
180 185 190 



Gly Ser Ala Leu Cys Asn Ala Gin Arg Trp Leu Asn Phe Gin Gly Asp 



195 



200 



205 



Thr Gly Asn Gly Leu Ala Pro Leu Asp lie Thr Phe His Leu Leu Glu 
210 215 220 



Pro Gly Gin Ala Leu Ala Asp Gly Met Lys Pro Leu Asp Gly Lys He 
225 230 235 " 240 



Thr Pro Cys Asn Glu Ser Gin Gly Glu Asp Ser Ala Ala Cys Ser Cys 
245 250 ' 255 



Gin Asp Cys Ala Ala Ser Cys Pro Val He Pro Pro Pro Pro Ala Leu 
260 265 270 



Arg Pro Ser Phe Tyr Met Gly Arg Met Pro Gly Trp Leu Ala Leu He 
275 280 285 



He He Phe Thr Ala Val Phe Val Leu Leu Ser Val Val Leu Val Tyr 
290 295 300 



Leu Arg Val Ala Ser Asn Arg Asn Lys Asn Lys Thr Ala Gly Ser Gin 
305 310 315 320 



Glu Ala Pro Asn Leu Pro Arg Lys Arg Arg Phe Ser Pro His Thr Val 
325 330 335 



Leu Gly Arg Phe Phe Glu Ser Trp Gly Thr Arg Val Ala Ser Trp Pro 
340 345 350 



Leu Thr Val Leu Ala Leu Ser Phe He Val Val He Ala Leu Ser Val 
355 360 365 



Gly Leu Thr Phe He Glu Leu Thr Thr Asp Pro Val Glu Leu Trp Ser 
370 375 380 



Ala Pro Lys Ser Gin Ala Arg Lys Glu Lys Ala Phe His Asp Glu His 
385 390 395 400 



Phe Gly Pro Phe Phe Arg Thr Asn Gin He Phe Val Thr Ala Lys Asn 
405 410 415 



Arg Ser Ser Tyr Lys Tyr Asp Ser Leu Leu Leu Gly Pro Lys Asn Phe 
420 425 430 



Ser Gly lie Leu Ser Leu Asp Leu Leu Gin Glu Leu Leu Glu Leu Gin 
435 " 440 445 



Glu Arg Leu Arg His Leu Gin Val Trp Ser His Glu Ala Gin Arg Asn 
450 455 460 



lie Ser Leu Gin Asp lie Cys Tyr Ala Pro Leu Asn Pro His Asn Thr 
465 470 475 480 



Ser Leu Thr Asp Cys Cys Val Asn Ser Leu Leu Gin Tyr Phe Gin Asn 
485 490 495 



Asn His Thr Leu Leu Leu Leu Thr Ala Asn Gin Thr Leu Asn Gly Gin 
500 505 510 



Thr Ser Leu Val Asp Trp Lys Asp His Phe Leu Tyr Cys Ala Asn Ala 
515 520 525 



Pro Leu Thr Tyr Lys Asp Gly Thr Ala Leu Ala Leu Ser Cys He Ala 
530 535 540 



Asp Tyr Gly Ala Pro Val Phe Pro Phe Leu Ala Val Gly Gly Tyr Gin 
545 550 555 560 



Gly Thr Asp Tyr Ser Glu Ala Glu Ala Leu He He Thr Phe Ser He 
565 570 575 



Asn Asn Tyr Pro Ala Asp Asp Pro Arg Met Ala His Ala Lys Leu Trp 
580 585 590 



Glu Glu Ala Phe Leu Lys Glu Met Gin Ser Phe Gin Arg Ser Thr Ala 
595 600 605 



Asp Lys Phe Gin He Ala Phe Ser Ala Glu Arg Ser Leu Glu Asp Glu 
610 615 620 



He Asn Arg Thr Thr He Gin Asp Leu Pro Val Phe Ala He Ser Tyr 
625 630 635 640 



Leu lie Val Phe Leu Tyr He Ser Leu Ala Leu Gly Ser Tyr Ser Arg 
645 650 655 



Trp Ser Arg Val Ala Val Asp Ser Lys Ala Thr Leu Gly Leu Gly Gly 
660 665 670 



Val Ala Val Val Leu Gly Ala Val Val Ala Ala Met Gly Phe Tyr Ser 
675 680 685 



Tyr Leu Gly Val Pro Ser Ser Leu Val He He Gin Val Val Pro Phe 
690 695 700 



Leu Val Leu Ala Val Gly Ala Asp Asn He Phe He Phe Val Leu Glu 
705 710 715 720 



Tyr Gin Arg Leu Pro Arg Met Pro Gly Glu Gin Arg Glu Ala His He 
725 730 735 



Gly Arg Thr Leu Gly Ser Val Ala Pro Ser Met Leu Leu Cys Ser Leu 
740 745 750 



Ser Glu Ala He Cys Phe Phe Leu Gly Ala Leu Thr Ser Met Pro Ala 
755 760 765 



Val Arg Thr Phe Ala Leu Thr Ser Gly Leu Ala He He Phe Asp Phe 
770 775 780 



Leu Leu Gin Met Thr Ala Phe Val Ala Leu Leu Ser Leu Asp Ser Lys 
785 790 795 800 



Arg Gin Glu Ala Ser Arg Pro Asp Val Val Cys Cys Phe Ser Ser Arg 
805 810 " 815 



Asn Leu Pro Pro Pro Lys Gin Lys Glu Gly Leu Leu Leu Cys Phe Phe 
820 825 830 



Arg Lys He Tyr Thr Pro Phe Leu Leu His Arg Phe He Arg Pro Val 
835 840 845 



Val Leu Leu Leu Phe Leu Val Leu Phe Gly Ala Asn Leu Tyr Leu Met 
850 855 860 



Cys Asn He Ser Val Gly Leu Asp Gin Asp Leu Ala Leu Pro Lys Asp 
865 870 875 880 



Ser Tyr Leu He Asp Tyr Phe Leu Phe Leu Asn Arg Tyr Leu Glu Val 
885 890 * 895 



Gly Pro Pro Val Tyr Phe Asp Thr Thr Ser Gly Tyr Asn Phe Ser Thr 
900 905 910 



Glu Ala Gly Met Asn Ala He Cys Ser Ser Ala Gly Cys Glu Ser Phe 
915 920 925 



Ser Leu Thr Gin Lys He Gin Tyr Ala Ser Glu Phe Pro Asn Gin Ser 
930 935 940 



Tyr Val Ala He Ala Ala Ser Ser Trp Val Asp Asp Phe He Asp Trp 
945 950 955 960 



Leu Thr Pro Ser Ser Ser Cys Cys Arg He Tyr Thr Arg Gly Pro His 
965 970 " 975 



Lys Asp Glu Phe Cys Pro Ser Thr Asp Thr Ser Phe Asn Cys Leu Lys 
980 985 990 



Asn Cys Met Asn Arg Thr Leu Gly Pro Val Arg Pro Thr Thr Glu Gin 
995 1000 1005 



Phe His Lys Tyr Leu Pro Trp Phe Leu Asn Asp Thr Pro Asn He 
1010 1015 1020 



Arg Cys Pro Lys Gly Gly Leu Ala Ala Tyr Arg Thr Ser Val Asn 
1025 1030 1035 



Leu Ser Ser Asp Gly Gin He He Ala Ser Gin Phe Met Ala Tyr 
1040 1045 1050 



His Lys Pro Leu Arg Asn Ser Gin Asp Phe Thr Glu Ala Leu Arg 
1055 1060 1065 



Ala Ser Arg Leu Leu Ala Ala Asn He Thr Ala Glu Leu Arg Lys 
1070 1075 1080 



Val Pro Gly Thr Asp Pro Asn Phe Glu Val Phe Pro Tyr Thr He 
1085 1090 1095 



Ser Asn Val Phe Tyr Gin Gin Tyr Leu Thr Val Leu Pro Glu Gly 
1100 1105 1110 



He Phe Thr Leu Ala Leu Cys Phe Val Pro Thr Phe Val Val Cys 
1115 1120 1125 



Tyr Leu Leu Leu Gly Leu Asp He Arg Ser Gly He Leu Asn Leu 



1130 



1135 



1140 



Leu Ser He He Met He Leu Val Asp Thr He Gly Leu Met Ala 
1145 1150 1155 



Val Trp Gly He Ser Tyr Asn Ala Val Ser Leu He Asn Leu Val 
1160 1165 1170 



Thr Ala Val Gly Met Ser Val Glu Phe Val Ser His He Thr Arg 
1175 1180 1185 



Ser Phe Ala Val Ser Thr Lys Pro Thr Arg Leu Glu Arg Ala Lys 
1190 1195 1200 



Asp Ala Thr He Phe Met Gly Ser Ala Val Phe Ala Gly Val Ala 
1205 1210 1215 



Met Thr Asn Phe Pro Gly He Leu He Leu Gly Phe Ala Gin Ala 
1220 1225 1230 



Gin Leu He Gin He Phe Phe Phe Arg Leu Asn Leu Leu He Thr 
1235 1240 1245 



Leu Leu Gly Leu Leu His Gly Leu Val Phe Leu Pro Val Val Leu 
1250 1255 1260 



Ser Tyr Leu Gly Pro Asp Val Asn Gin Ala Leu Val Leu Glu Glu 
1265 1270 1275 



Lys Leu Ala Thr Glu Ala Ala Met Val Ser Glu Pro Ser Cys Pro 
1280 1285 1290 



Gin Tyr Pro Phe Pro Ala Asp Ala Asn Thr Ser Asp Tyr Val Asn 
1295 1300 1305 



Tyr Gly Phe Asn Pro Glu Phe He Pro Glu He Asn Ala Ala Ser 
1310 1315 1320 



Ser Ser Leu Pro Lys Ser Asp Gin Lys Phe 
1325 1330 



<210> 13 



<211> 3999 



<212> DNA 

<213> Mus sp. 
<220> 

<221> misc_feature 

<222> (1)..(3999) 

<223> n is g or a or t or c 

<400> 13 



atggcngcng 


cntggcargg 


ntggytnytn 


tgggcnytny 


tnytnaayws 


ngcncarggn 


60 


garytntaya 


cnccnacnca 


yaargcnggn 


ttytgyacnt 


tytaygarga 


rtgyggnaar 


120 


aayccngary 


tnwsnggngg 


nytnacnwsn 


ytnwsnaaya 


thwsntgyyt 


nwsnaayacn 


180 


ccngcnmgnc 


aygtnacngg 


ngaycayytn 


gcnytnytnc 


armgngtntg 


yccnmgnytn 


240 


tayaayggnc 


cnaaygayac 


ntaygcntgy 


tgywsnacna 


arcarytngt 


nwsnytngay 


300 


wsnwsnytnw 


snathacnaa 


rgcnytnytn 


acnmgntgyc 


cngcntgyws 


ngaraaytty 


360 


gtnwsnathc 


aytgycayaa 


yacntgywsn 


ccngaycarw 


snytnttyat 


haaygtnacn 


420 


mgngtngtnc 


armgngaycc 


nggncarytn 


ccngcngtng 


tngcntayga 


rgcnttytay 


480 


carmgnwsnt 


tygcngaraa 


rgcntaygar 


wsntgywsnm 


gngtnmgnat 


hccngcngcn 


540 


gcnwsnytng 


cngtnggnws 


natgtgyggn 


gtntayggnw 


sngcnytntg 


yaaygcncar 


600 


mgntggytna 


ayttycargg 


ngayacnggn 


aayggnytng 


cnccnytnga 


yathacntty 


660 


cayytnytng 


arccnggnca 


rgcnytngcn 


gayggnatga 


arccnytnga 


yggnaarath 


720 


acnccntgya 


aygarwsnca 


rggngargay 


wsngcngcnt 


gywsntgyca 


rgaytgygcn 


780 


gcnwsntgyc 


cngtnathcc 


nccnccnccn 


gcnytnmgnc 


cnwsnttyta 


yatgggnmgn 


840 


atgccnggnt 


ggytngcnyt 


nathathath 


ttyacngcng 


tnttygtnyt 


nytnwsngtn 


900 


gtnytngtnt 


ayytnmgngt 


ngcnwsnaay 


mgnaayaara 


ayaaracngc 


nggnwsncar 


960 


gargcnccna 


ayytnccnmg 


naarmgnmgn 


ttywsnccnc 


ayacngtnyt 


nggnmgntty 


1020 


ttygarwsnt 


ggggnacnmg 


ngtngcnwsn 


tggccnytna 


cngtnytngc 


nytnwsntty 


1080 


athgtngtna 


thgcnytnws 


ngtnggnytn 


acnttyathg 


arytnacnac 


ngayccngtn 


1140 


garytntggw 


sngcnccnaa 


rwsncargcn 


mgnaargara 


argcnttyca 


ygaygarcay 


1200 


ttyggnccnt 


tyttymgnac 


naaycarath 


ttygtnacng 


cnaaraaymg 


nwsnwsntay 


1260 


aartaygayw 


snytnytnyt 


nggnccnaar 


aayttywsng 


gnathytnws 


nytngayytn 


1320 


ytncargary 


tnytngaryt 


ncargarmgn 


yt nmgncayy 


tncargtntg 


gwsncaygar 


1380 



gcncarmgna 


ayathwsnyt 


ncargayath 


tqytayqcnc 


cnytnaaycc 


ncayaayacn 


1440 


wsnytnacng 


aytgytqygt 


naaywsnytn 


ytncartayt 


tycaraayaa 


ycayacnytn 


1500 


ytnytnytna 


cngcnaayca 


racnytnaay 


ggncaracnw 


snytngtnga 


ytqqaarqay 


1560 


cayttyytnt 


aytgygcnaa 


ygcnccnytn 


acntayaarg 


ayqqnacnqc 


nytngcnytn 


1620 


wsntgyathg 


cngaytaygg 


ngcnccngtn 


ttyccnttyy 


tngcngtngg 


nggntaycar 


1680 


ggnacngayt 


aywsngargc 


ngargcnytn 


athathacnt 


tywsnathaa 


yaaytayccn 


1740 


qcnqayqayc 


cnmgnatggc 


ncaygcnaar 


vtntqqqarq 


argcnttyyt 


naargaratg 


1800 


carwsnttyc 


armgnwsnac 


ngcngayaar 


ttycarathg 


cnttywsngc 


ngarmgnwsn 


1860 


ytngargayg 


arathaaymg 


nacnacnath 


cargayytnc 


cngtnttygc 


nathwsntay 


1920 


ytnathgtnt 


tyytntayat 


hwsnytngcn 


ytnggnwsnt 


aywsnmgntg 


gwsnmgngtn 


1980 


gcngtngayw 


snaargcnac 


nytnggnytn 


ggnggngtng 


cngtngtnyt 


naonacnatn 


2040 


gtngcngcna 


tgggnttyta 


ywsntayytn 


ggngtnccnw 


snwsnytngt 


nathathcar 


2100 


gtngtnccnt 


tyytnqtnyt 


ngcngtnggn 


gcngayaaya 


thttyathtt 


ygtnyt ngar 


2160 


taycarmgny 


tnccnmgnat 


qccnqanqar 


carmgngarg 


cncayathgg 


nn\gn a cny t n 


2220 


ggnwsngtng 


cnccnwsnat 


qvtnvtntqv 


wsnytnwsng 


argcnathtg 


vtt vtt vvtn 


2280 


ggngcnytna 


cnwsnatgcc 


ngcngtnmgn 


acnttygcny 


tnacnwsngg 


nytngcnath 


2340 


athttygayt 


tyytnytnca 


ratgacngcn 


ttygtngcny 


tny tnwsnyt 


ngaywsnaair 


2400 


mgncargarg 


cnwsnmgncc 


ngaygtngtn 


tqvtqytt yw 


snwsnmgnaa 


yytnccnccn 


2460 


ccnaarcara 


argarggnyt 


nytnytntgy 


ttyttymgna 


arathtayac 


nccnt t yyt n 


2520 


ytncaymgnt 


tyathmgncc 


ngtngtnytn 


vtnvtntt vv 


tngtnytntt 


yggngcnaay 


2580 


ytntayytna 


tqtqyaayat 


hwsngtnggn 


vtnqa vcarq 


ayytngcnyt 


nccnaargay 


2640 


wsntayytna 


thgaytaytt 


vvtntt vvtn 


aaymgntayy 


tngargtngg 


nrrnrrnntn 


2700 


ta vtt vaava 


cnacnwsngg 


ntayaaytty 


wsnannaa rn 


cnrrrrna 1~ n?t pi 




71 fin 


wsnwsngcng 


gntgygarws 


nt" t" vwsnvt n 


□ vllv^Ql CJ CJ. J_ CI 




nuQnna r"i~ t" \/ 
iiwonydi i_i_y 


O \J 


ccnaaycarw 


sntaygtngc 


nathgcngcn 


wsnwsntggg 


tngaygaytt 


yathgaytgg 


2880 


ytnacnccnw 


snwsnwsntg 


ytgymgnath 


tayacnmgng 


gnccncayaa 


rgaygartty 


2940 


tgyccnwsna 


cngayacnws 


nttyaaytgy 


ytnaaraayt 


gyatgaaymg 


nacnytnggn 


3000 


ccngtnmgnc 


cnacnacnga 


rcarttycay 


aartayytnc 


cntggttyyt 


naaygayacn 


3060 


ccnaayathm 


gntgyccnaa 


rggnggnytn 


gcngcntaym 


gnacnwsngt 


naayytnwsn 


3120 



wsngayggnc 


arathathgc 


nwsncartty 


atggcntayc 


ayaarccnyt 


nmgnaaywsn 


3180 


cargayttya 


cngargcnyt 


nmgngcnwsn mgnytnytng 


cngcnaayat 


hacngcngar 


3240 


ytnmgnaarg 


tnccnggnac 


ngayccnaay 


ttygargtnt 


tyccntayac 


nathwsnaay 


3300 


gtnttytayc 


arcartayyt 


nacngtnytn 


ccngarggna 


thttyacnyt 


ngcnytntgy 


3360 


ttygtnccna 


cnttygtngt 


ntgytayytn 


ytnytnggny tngayathmg 


nwsnggnath 


3420 


ytnaayytny 


tnwsnathat 


hatgathytn 


gtngayacna 


thggnytnat 


ggcngtntgg 


3480 


ggnathwsnt 


ayaaygcngt 


nwsnytnath 


aayytngtna 


cngcngtngg 


natgwsngtn 


3540 


garttygtnw 


sncayathac 


nmgnwsntty 


gcngtnwsna 


cnaarccnac 


nmgnytngar 


3600 


mgngcnaarg 


aygcnacnat 


httyatgggn 


wsngcngtnt 


tygcnggngt 


ngcnatgacn 


3660 


aayttyccng 


gnathytnat 


hytnggntty gcncargcnc 


arytnathca 


rathttytty 


3720 


ttymgnytna 


ayytnytnat 


hacnytnytn 


ggnytnytnc 


ayggnytngt 


nttyytnccn 


3780 


gtngtnytnw 


sntayytngg 


nccngaygtn 


aaycargcny 


tngtnytnga 


rgaraarytn 


3840 


gcnacngarg 


cngcnatggt 


nwsngarccn 


wsntgyccnc 


artayccntt 


yccngcngay 


3900 


gcnaayacnw 


sngaytaygt 


naaytayggn 


ttyaayccng 


arttyathcc 


ngarathaay 


3960 


gcngcnwsnw 


snwsnytncc 


naarwsngay 


caraartty 
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<210> 14 

<211> 20 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> primer 

<400> 14 

tcttcaccct tgctctttgc 20 

<210> 15 

<211> 24 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 

<400> 15 

aatgatggag agtaggttga ggat 24 

<210> 16 

<211> 26 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 

<400> 16 

tgcccacctt tgttgtctgc taccta 26 

<210> 17 

<211> 21 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 

<400> 17 

atcgctgaca ggatgcagaa g 21 

<210> 18 

<211> 22 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 
<400> 18 



tcaggaggag caatgatctt ga 

<210> 19 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 19 

agattactgc cctggctcct agcaccatta 

<210> 20 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 20 

atcctcatcc tgggctttgc 

<210> 21 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 21 

gcaaggtgat caggaggttg a 



<210> 22 



<211> 
<212> 
<213> 



29 



DNA 



Artificial Sequence 



<220> 

<223> primer 

<400> 22 

cccagcttat ccagattttc ttcttccgc 

<210> 23 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 23 

tcttcaccct tgctctttgc 

<210> 24 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 24 

aatgatggag agtaggttga ggat 

<210> 25 

<211> 24 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 

<400> 25 

tgcccacctt tgttgtctgc tacc 24 

<210> 26 

<211> 23 

<212> DNA 

<213> Artificial Seguence 
<220> 

<223> primer 

<400> 26 

agcacctgtc cactgaagat ttc 23 

<210> 27 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 27 

tggacgctga gcttcagttc t 21 

<210> 28 

<211> 24 

<212> DNA 

<213> Artificial Sequence 



<220> 



<223> primer 

<400> 28 

cttctctgcg ctgcctcgat ggaa 

<210> 29 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 29 

agtaaaaagg gctcgcagga t 

<210> 30 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 30 

ggcagctggt gacatcagag a 

<210> 31 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 31 

aggaggccat gcaggcctac tctga 



<210> 32 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 32 

gagtccacgg tcagtccatg t 

<210> 33 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 33 

ttatgaacaa caatgccaag caa 

<210> 34 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 34 

agtccttagg tagtggctta gtccctggaa gctc 



<210> 35 
<211> 52 



<212> DNA 

<213> Artificial Sequence 
<220> 

<223> probe 

<400> 35 

gtaatacgac tcactatagg gccctgacgg tccttcctga gggaatcttc ac 52 

<210> 36 

<211> 50 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> probe 

<400> 36 

gtaatacgac tcactatagg gcctgggaag ttggtcatgg ccactccagc 50 

<210> 37 

<211> 8 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> FLAG tag 

<400> 37 

Asp Tyr Lys Asp Asp Asp Asp Lys 
1 5 

<210> 38 

<211> 4 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> motif 

<400> 38 

Tyr Gin Arg Leu 
1 

<210> 39 
<211> 19 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> antigen 
<400> 39 

Glu Gin Phe His Lys Tyr Leu Pro Trp Phe Leu Asn Asp Pro Pro Asn 
15 10 15 

lie Arg Cys 

<210> 40 

<211> 16 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> antigen 

<400> 40 

Glu Ala Phe Tyr Gin Arg Ser Phe Ala Glu Lys Ala Tyr Glu Ser Cys 
1 5 10 15 



<210> 41 



<211> 
<212> 
<213> 



15 
PRT 

Artificial Sequence 



<220> 

<223> antigen 
<400> 41 

Gly Gin Thr Ser Leu Val Asp Trp Lys Asp His Phe Leu Tyr Cys 
15 10 15 

<210> 42 

<211> 17 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> antigen 
<400> 42 

Cys Ala Asn Ala Pro Leu Thr Phe Lys Asp Gly Thr Ala Leu Ala Leu 
15 10 15 

Ser 

<210> 43 

<211> 5092 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 

<222> (57).. (4136) 

<223> 



<400> 43 

cttggctgtt cctgaggcct ggcctggctc cccgctgacc ccttcccaga cctggg atg 59 

Met 
1 

gcg gag gcc ggc ctg agg ggc tgg ctg ctg tgg gcc ctg etc ctg cgc 107 
Ala Glu Ala Gly Leu Arg Gly Trp Leu Leu Trp Ala Leu Leu Leu Arg 
5 10 15 

ttg gcc cag agt gag cct tac aca acc ate cac cag cct ggc tac tgc 155 
Leu Ala Gin Ser Glu Pro Tyr Thr Thr lie His Gin Pro Gly Tyr Cys 
20 25 30 

gcc ttc tat gac gaa tgt ggg aag aac cca gag ctg tct gga age etc 203 
Ala Phe Tyr Asp Glu Cys Gly Lys Asn Pro Glu Leu Ser Gly Ser Leu 
35 40 45 

atg aca etc tec aac gtg tec tgc ctg tec aac acg ccg gcc cgc aag 251 
Met Thr Leu Ser Asn Val Ser Cys Leu Ser Asn Thr Pro Ala Arg Lys 
50 55 60 65 

ate aca ggt gat cac ctg ate eta tta cag aag ate tgc ccc cgc etc 299 
lie Thr Gly Asp His Leu lie Leu Leu Gin Lys lie Cys Pro Arg Leu 
70 75 " 80 

tac acc ggc ccc aac acc caa gcc tgc tgc tec gcc aag cag ctg gta 347 
Tyr Thr Gly Pro Asn Thr Gin Ala Cys Cys Ser Ala Lys Gin Leu Val 
85 90 95 

tea ctg gaa gcg agt ctg teg ate acc aag gcc etc etc acc cgc tgc 395 
Ser Leu Glu Ala Ser Leu Ser lie Thr Lys Ala Leu Leu Thr Arg Cys 
100 105 110 

cca gcc tgc tct gac aat ttt gtg aac ctg cac tgc cac aac acg tgc 443 
Pro Ala Cys Ser Asp Asn Phe Val Asn Leu His Cys His Asn Thr Cys 
115 120 125 

age ccc aat cag age etc ttc ate aat gtg acc cgc gtg gcc cag eta 4 91 

Ser Pro Asn Gin Ser Leu Phe lie Asn Val Thr Arg Val Ala Gin Leu 
130 135 140 145 

ggg get gga caa etc cca get gtg gtg gcc tat gag gcc ttc tac cag 539 
Gly Ala Gly Gin Leu Pro Ala Val Val Ala Tyr Glu Ala Phe Tyr Gin 
150 155 160 

cat age ttt gcc gag cag age tat gac tec tgc age cgt gtg cgc gtc 587 
His Ser Phe Ala Glu Gin Ser Tyr Asp Ser Cys Ser Arg Val Arg Val 
165 170 175 

cct gca get gcc acg ctg get gtg ggc acc atg tgt ggc gtg tat ggc 635 
Pro Ala Ala Ala Thr Leu Ala Val Gly Thr Met Cys Gly Val Tyr Gly 
180 185 190 

tct gcc ctt tgc aat gcc cag cgc tgg etc aac ttc cag gga gac aca 683 
Ser Ala Leu Cys Asn Ala Gin Arg Trp Leu Asn Phe Gin Gly Asp Thr 
195 200 205 



ggc aat ggt ctg gcc cca ctg gac ate acc ttc cac etc ttg gag cct 
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Gly Asn Gly Leu Ala Pro Leu Asp lie Thr Phe His Leu Leu Glu Pro 
210 ' 215 220 225 



ggc cag gcc 
Gly Gin Ala 



cgt tgc aat 
Arg Cys Asn 



gac tgt get 
Asp Cys Ala 
260 

tec acc ttc 
Ser Thr Phe 
275 

ate etc tgc 
lie Leu Cys 
290 

cgt gtg gcc 
Arg Val Ala 



ggc acc age 
Gly Thr Ser 



ggc cag ttc 
Gly Gin Phe 
340 

acc ate ttg 
Thr lie Leu 
355 

ctg gtc ttt 
Leu Val Phe 
370 

ccc aac age 
Pro Asn Ser 



ggc ccc ttc 
Gly Pro Phe 



tec age tac 
Ser Ser Tyr 
420 

gga ate ctg 
Gly lie Leu 
435 



gtg ggg agt 
Val Gly Ser 
230 

gag tec caa 
Glu Ser Gin 
245 

gca tec tgt 
Ala Ser Cys 



tac ctg ggc 
Tyr Leu Gly 



tct gtc ttc 
Ser Val Phe 
295 

ccc gcc agg 
Pro Ala Arg 
310 

etc tct gac 
Leu Ser Asp 
325 

ttc cag ggc 
Phe Gin Gly 



gtg eta tct 
Val Leu Ser 



aca gaa etc 
Thr Glu Leu 
375 

caa gcc egg 
Gin Ala Arg 
390 

ttc cga acc 
Phe Arg Thr 
405 

agg tat gac 
Arg Tyr Asp 



gac ctg gac 
Asp Leu Asp 



ggg att cag 
Gly He Gin 



ggt gac gac 
Gly Asp Asp 
250 

cct gcc ata 
Pro Ala He 
265 

cag atg ccg 
Gin Met Pro 
280 

get gtg gtc 
Ala Val Val 



gac aaa age 
Asp Lys Ser 



aag etc age 
Lys Leu Ser 
330 

tgg ggc acg 
Trp Gly Thr 
345 

gtc ate ccg 
Val He Pro 
360 

act acg gac 
Thr Thr Asp 



agt gag aaa 
Ser Glu Lys 



aac cag gtg 
Asn Gin Val 
410 

tct ctg ctg 
Ser Leu Leu 
425 

ttg ctg ctg 
Leu Leu Leu 
440 



cct ctg aat 
Pro Leu Asn 
235 

gtg gcg acc 
Val Ala Thr 



gcc cgc ccc 
Ala Arg Pro 



ggc agt ctg 
Gly Ser Leu 
285 

acc ate ctg 
Thr He Leu 
300 

aag atg gtg 
Lys Met Val 
315 

ttc tec acc 
Phe Ser Thr 



tgg gtg get 
Trp Val Ala 



gtg gtg gcc 
Val Val Ala 
365 

ccc gtg gag 
Pro Val Glu 
380 

get ttc cat 
Ala Phe His 
395 

ate ctg acg 
He Leu Thr 



ctg ggg ccc 
Leu Gly Pro 



gag ctg eta 
Glu Leu Leu 
445 



gag ggg gtt 
Glu Gly Val 
240 

tgc tec tgc 
Cys Ser Cys 
255 

cag gcc etc 
Gin Ala Leu 
270 

gtc etc ate 
Val Leu He 



ctt gtg gga 
Leu Val Gly 



gac ccc aag 
Asp Pro Lys 
320 

cac acc etc 
His Thr Leu 
335 

teg tgg cct 
Ser Trp Pro 
350 

ttg gca gcg 
Leu Ala Ala 



ctg tgg teg 
Leu Trp Ser 



gac cag cat 
Asp Gin His 
400 

get cct aac 
Ala Pro Asn 
415 

aag aac ttc 
Lys Asn Phe 
430 

gag ctg cag 
Glu Leu Gin 



gca 779 
Ala 



caa 827 
Gin 



gac 875 
Asp 



ate 923 
He 



ttc 971 

Phe 

305 

aag 1019 
Lys 



ctt 1067 
Leu 



ctg 1115 
Leu 



ggc 1163 
Gly 



gcc 1211 

Ala 

385 

ttc 1259 
Phe 



egg 1307 
Arg 



age 1355 
Ser 



gag 1403 
Glu 



agg ctg egg 
Arg Leu Arg 
450 

tec ctg cag 
Ser Leu Gin 



etc tac gac 
Leu Tyr Asp 



cgc acg etc 
Arg Thr Leu 
500 

tec caa gtc 
Ser Gin Val 
515 

etc acc ttc 
Leu Thr Phe 
530 

tac ggg gee 
Tyr Gly Ala 



aag gac tat 
Lys Asp Tyr 



aat tac cct 
Asn Tyr Pro 
580 

gag gee ttc 
Glu Ala Phe 
595 

atg ttc cag 
Met Phe Gin 
610 

aat cgc acc 
Asn Arg Thr 



gtc ata ttc 
Val He Phe 



age cga gtg 
Ser Arg Val 
660 

gec gtg gtc 
Ala Val Val 
675 



cac etc cag 
His Leu Gin 
455 

gac ate tgc 
Asp He Cys 
470 

tgc tgc ate 
Cys Cys He 
485 

ctg ctg etc 
Leu Leu Leu 



gac tgg aag 
Asp Trp Lys 



aag gat ggc 
Lys Asp Gly 
535 

cct gtc ttc 
Pro Val Phe 
550 

tct gag gca 
Ser Glu Ala 
565 

gee ggg gac 
Ala Gly Asp 



tta gag gaa 
Leu Glu Glu 



gtc acg ttc 
Val Thr Phe 
615 

aca get gaa 
Thr Ala Glu 
630 

ctg tac ate 
Leu Tyr He 
645 

atg gtg gac 
Met Val Asp 



ctg gga gca 
Leu Gly Ala 



gta tgg teg 
Val Trp Ser 



tac gec ccc 
Tyr Ala Pro 



aac age etc 
Asn Ser Leu 
490 

aca gee aac 
Thr Ala Asn 
505 

gac cat ttt 
Asp His Phe 
520 

aca gec ctg 
Thr Ala Leu 



ccc ttc ctt 
Pro Phe Leu 



gag gec ctg 
Glu Ala Leu 
570 

ccc cgt ctg 
Pro Arg Leu 
585 

atg cga gec 
Met Arg Ala 
600 

atg get gag 
Met Ala Glu 



gac ctg ccc 
Asp Leu Pro 



tct ctg gee 
Ser Leu Ala 
650 

tec aag gec 
Ser Lys Ala 
665 

gtc atg get 
Val Met Ala 
680 



ccc gaa gca 
Pro Glu Ala 
4 60 

etc aat ccg 
Leu Asn Pro 
475 

ctg cag tat 
Leu Gin Tyr 



cag aca ctg 
Gin Thr Leu 



ctg tac tgt 
Leu Tyr Cys 
525 

gee ctg age 
Ala Leu Ser 
540 

gec att ggg 
Ala He Gly 
555 

ate atg acg 
He Met Thr 



gec cag gec 
Ala Gin Ala 



ttc cag cgt 
Phe Gin Arg 
605 

cgc tct ctg 
Arg Ser Leu 
620 

ate ttt gec 
He Phe Ala 
635 

ctg ggc age 
Leu Gly Ser 



acg ctg ggc 
Thr Leu Gly 



gec atg ggc 
Ala Met Gly 
685 



cag cgc aac 
Gin Arg Asn 



gac aat acc 
Asp Asn Thr 
480 

ttc cag aac 
Phe Gin Asn 
495 

atg ggg cag 
Met Gly Gin 
510 

gee aat gee 
Ala Asn Ala 



tgc atg get 
Cys Met Ala 



ggg tac aaa 
Gly Tyr Lys 
560 

ttc tec etc 
Phe Ser Leu 
575 

aag ctg tgg 
Lys Leu Trp 
590 

egg atg get 
Arg Met Ala 



gaa gac gag 
Glu Asp Glu 



acc age tac 
Thr Ser Tyr 
640 

tat tec age 
Tyr Ser Ser 
655 

etc ggc ggg 
Leu Gly Gly 
670 

ttc ttc tec 
Phe Phe Ser 



ate 1451 

He 

465 

agt 1499 
Ser 



aac 1547 
Asn 



acc 1595 
Thr 



ccg 1643 
Pro 



gac 1691 

Asp 

545 

gga 1739 
Gly 



aac 1787 
Asn 



gag 1835 
Glu 



ggc 1883 
Gly 



ate 1931 

He 

625 

att 1979 
He 



tgg 2027 
Trp 



gtg 2075 
Val 



tac 2123 
Tyr 



ttg ggt ate 
Leu Gly lie 
690 

gtg ctg tec 
Val Leu Ser 



cag agg ctg 
Gin Arg Leu 



cga gec eta 
Arg Ala Leu 
740 

gag gee ate 
Glu Ala He 
755 

egg acc ttt 
Arg Thr Phe 
770 

ctg cag atg 
Leu Gin Met 



cag gag gec 
Gin Glu Ala 



ctg ccc ccg 
Leu Pro Pro 
820 

aag get tat 
Lys Ala Tyr 
835 

ctg ctg ctg 
Leu Leu Leu 
850 

cac ate age 
His He Ser 



tac ctg ctt 
Tyr Leu Leu 



gec ccg gtg 
Ala Pro Val 
900 

get ggg atg 
Ala Gly Met 



cgc tec tec 
Arg Ser Ser 
695 

gtg ggg get 
Val Gly Ala 
710 

ccc egg agg 
Pro Arg Arg 
725 

ggc agg gtg 
Gly Arg Val 



tgc ttc ttc 
Cys Phe Phe 



gec ctg acc 
Ala Leu Thr 
775 

tea gee ttt 
Ser Ala Phe 
790 

tec egg ttg 
Ser Arg Leu 
805 

cct ggc cag 
Pro Gly Gin 



gee ccc ttc 
Ala Pro Phe 



ttt etc gee 
Phe Leu Ala 
855 

gtg gga ctg 
Val Gly Leu 
870 

gac tat ttc 
Asp Tyr Phe 
885 

tac ttt gtt 
Tyr Phe Val 



aat gee ate 
Asn Ala He 



ctg gtc ate 
Leu Val He 



gat aac ate 
Asp Asn He 



cct ggg gag 
Pro Gly Glu 
730 

get ccc age 
Ala Pro Ser 
745 

eta ggg gee 
Leu Gly Ala 
760 

tct ggc ctt 
Ser Gly Leu 



gtg gee ctg 
Val Ala Leu 



gac gtc tgc 
Asp Val Cys 
810 

gga gag ggg 
Gly Glu Gly 
825 

ctg ctg cac 
Leu Leu His 
840 

ctg ttc gga 
Leu Phe Gly 



gac cag gag 
Asp Gin Glu 



etc ttt ctg 
Leu Phe Leu 
890 

acc acc ttg 
Thr Thr Leu 
905 

tgc tec agt 
Cys Ser Ser 



ctg caa gtg 
Leu Gin Val 
700 

ttc ate ttt 
Phe He Phe 
715 

cca cga gag 
Pro Arg Glu 



atg ctg ttg 
Met Leu Leu 



ctg acc ccc 
Leu Thr Pro 
765 

gca gtg ate 
Ala Val He 
780 

etc tec ctg 
Leu Ser Leu 
795 

tgc tgt gtc 
Cys Cys Val 



etc ctg ctt 
Leu Leu Leu 



tgg ate act 
Trp He Thr 
845 

gtg age etc 
Val Ser Leu 
860 

ctg gec ctg 
Leu Ala Leu 
875 

aac cgc tac 
Asn Arg Tyr 



ggc tac aac 
Gly Tyr Asn 



gca ggc tgc 
Ala Gly Cys 



gtt cct ttc 
Val Pro Phe 



gtt etc gag 
Val Leu Glu 
720 

gtc cac att 
Val His He 
735 

tgc age etc 
Cys Ser Leu 
750 

atg cca get 
Met Pro Ala 



ctt gac ttc 
Leu Asp Phe 



gac age aag 
Asp Ser Lys 
800 

aag ccc cag 
Lys Pro Gin 
815 

ggc ttc ttc 
Gly Phe Phe 
830 

cga ggt gtt 
Arg Gly Val 



tac tec atg 
Tyr Ser Met 



ccc aag gac 
Pro Lys Asp 
880 

ttc gag gtg 
Phe Glu Val 
895 

ttc tec age 
Phe Ser Ser 
910 

aac aac ttc 
Asn Asn Phe 



ctg 2171 

Leu 

705 

tac 2219 
Tyr 



ggg 2267 
Gly 



tct 2315 
Ser 



gtg 2363 
Val 



etc 2411 

Leu 

785 

agg 2459 
Arg 



gag 2507 
Glu 



caa 2555 
Gin 



gtg 2603 
Val 



tgc 2651 

Cys 

865 

teg 2699 
Ser 



ggg 2747 
Gly 



gag 2795 
Glu 



tec 2843 
Ser 



915 



920 



925 



ttc acc cag aag ate cag tat gec aca gag ttc cct gag cag tct tac 
Phe Thr Gin Lys lie Gin Tyr Ala Thr Glu Phe Pro Glu Gin Ser Tyr 
930 935 940 945 

ctg gec ate cct gec tec tec tgg gtg gat gac ttc att gac tgg ctg 
Leu Ala lie Pro Ala Ser Ser Trp Val Asp Asp Phe lie Asp Trp Leu 
950 955 960 

acc ccg tec tec tgc tgc cgc ctt tat ata tct ggc ccc aat aag gac 
Thr Pro Ser Ser Cys Cys Arg Leu Tyr lie Ser Gly Pro Asn Lys Asp 
965 970 975 

aag ttc tgc ccc teg acc gtc aac tct ctg aac tgc eta aag aac tgc 
Lys Phe Cys Pro Ser Thr Val Asn Ser Leu Asn Cys Leu Lys Asn Cys 
980 985 ' 990 

atg age ate acg atg ggc tct gtg agg ccc teg gtg gag cag ttc cat 
Met Ser He Thr Met Gly Ser Val Arg Pro Ser Val Glu Gin Phe His 
995 1000 1005 

aag tat ctt ccc tgg ttc ctg aac gac egg ccc aac ate aaa tgt 
Lys Tyr Leu Pro Trp Phe Leu Asn Asp Arg Pro Asn He Lys Cys 
1010 1015 1020 

ccc aaa ggc ggc ctg gca gca tac age acc tct gtg aac ttg act 
Pro Lys Gly Gly Leu Ala Ala Tyr Ser Thr Ser Val Asn Leu Thr 
1025 1030 1035 

tea gat ggc cag gtt tta gac aca gtt gec att ctg tea ccc agg 
Ser Asp Gly Gin Val Leu Asp Thr Val Ala He Leu Ser Pro Arg 
1040 1045 1050 

ctg gag tac agt ggc aca ate teg get cac tgc aac etc tac etc 
Leu Glu Tyr Ser Gly Thr He Ser Ala His Cys Asn Leu Tyr Leu 
1055 1060 1065 

ctg gat tea gec tec agg ttc atg gee tat cac aag ccc ctg aaa 
Leu Asp Ser Ala Ser Arg Phe Met Ala Tyr His Lys Pro Leu Lys 
1070 1075 1080 

aac tea cag gat tac aca gaa get ctg egg gca get cga gag ctg 
Asn Ser Gin Asp Tyr Thr Glu Ala Leu Arg Ala Ala Arg Glu Leu 
1085 1090 1095 

gca gec aac ate act get gac ctg egg aaa gtg cct gga aca gac 
Ala Ala Asn He Thr Ala Asp Leu Arg Lys Val Pro Gly Thr Asp 
1100 1105 1110 

ccg get ttt gag gtc ttc ccc tac acg ate acc aat gtg ttt tat 
Pro Ala Phe Glu Val Phe Pro Tyr Thr He Thr Asn Val Phe Tyr 
1115 1120 1125 



gag cag tac ctg acc ate 
Glu Gin Tyr Leu Thr He 
1130 1135 



etc cct gag ggg etc ttc atg etc age 
Leu Pro Glu Gly Leu Phe Met Leu Ser 
1140 



etc tgc ctt gtg ccc acc ttc get gtc tec tgc etc ctg ctg ggc 



Leu Cys Leu Val Pro Thr Phe Ala Val Ser Cys Leu Leu Leu Gly 
1145 ~ 1150 1155 



ctg gac ctg cgc tec ggc etc etc aac ctg etc tec att gtc atg 3578 
Leu Asp Leu Arg Ser Gly Leu Leu Asn Leu Leu Ser lie Val Met 
1160 1165 1170 

ate etc gtg gac act gtc ggc ttc atg gee ctg tgg ggc ate agt 3623 
He Leu Val Asp Thr Val Gly Phe Met Ala Leu Trp Gly He Ser 
1175 1180 1185 

tac aat get gtg tec etc ate aac ctg gtc teg gcg gtg ggc atg 3668 
Tyr Asn Ala Val Ser Leu He Asn Leu Val Ser Ala Val Gly Met 
1190 1195 1200 

tct gtg gag ttt gtg tec cac att acc cgc tec ttt gee ate age 3713 
Ser Val Glu Phe Val Ser His He Thr Arg Ser Phe Ala He Ser 
1205 1210 1215 

acc aag ccc acc tgg ctg gag agg gee aaa gag gec acc ate tct 3758 
Thr Lys Pro Thr Trp Leu Glu Arg Ala Lys Glu Ala Thr He Ser 
1220 1225 1230 

atg gga agt gcg gtg ttt gca ggt gtg gec atg acc aac ctg cct 3803 
Met Gly Ser Ala Val Phe Ala Gly Val Ala Met Thr Asn Leu Pro 
1235 1240 1245 

ggc ate ctt gtc ctg ggc etc gec aag gec cag etc att cag ate 3848 
Gly He Leu Val Leu Gly Leu Ala Lys Ala Gin Leu He Gin lie 
1250 1255 1260 

ttc ttc ttc cgc etc aac etc ctg ate act ctg ctg ggc ctg ctg 3893 
Phe Phe Phe Arg Leu Asn Leu Leu He Thr Leu Leu Gly Leu Leu 
1265 1270 1275 

cat ggc ttg gtc ttc ctg ccc gtc ate etc age tac gtg ggg cct 3938 
His Gly Leu Val Phe Leu Pro Val He Leu Ser Tyr Val Gly Pro 
1280 1285 1290 

gac gtt aac ccg get ctg gca ctg gag cag aag egg get gag gag 3983 
Asp Val Asn Pro Ala Leu Ala Leu Glu Gin Lys Arg Ala Glu Glu 
1295 1300 1305 

gcg gtg gca gca gtc atg gtg gee tct tgc cca aat cac ccc tec 4028 
Ala Val Ala Ala Val Met Val Ala Ser Cys Pro Asn His Pro Ser 
1310 1315 1320 

cga gtc tec aca get gac aac ate tat gtc aac cac age ttt gaa 4073 
Arg Val Ser Thr Ala Asp Asn He Tyr Val Asn His Ser Phe Glu 
1325 1330 1335 

ggt tct ate aaa ggt get ggt gee ate age aac ttc ttg ccc aac 4118 
Gly Ser He Lys Gly Ala Gly Ala He Ser Asn Phe Leu Pro Asn 
1340 1345 1350 

aat ggg egg cag ttc tga tacagecaga ggccctgtct aggctctatg 4166 

Asn Gly Arg Gin Phe 

1355 



gccctgaacc 


aaagggttat 


ggggatcttc 


cttgtgactg 


ccccttgaca 


cacgccctcc 


4226 


tcaaatccta 


ggggaggcca 


ttcccatgag 


actgcctgtc 


actggaggat 


ggcctgctct 


4286 


tgaggtatcc 


aggcagcacc 


actgatggct 


cctctgctcc 


catagtgggt 


ccccagtttc 


4346 


caagtcacct 


aggccttggg 


cagtgcctcc 


tcctgggcct 


gggtctggaa 


gttggcagga 


4406 


acagacacac 


tccatgtttg 


tcccacactc 


actcactttc 


ctaggagccc 


acttctcatc 


4466 


caacttttcc 


cttctcagtt 


cctctctcga 


aagtcttaat 


tctgtgtcag 


taagtcttta 


4526 


acacgtagca 


gtgtccctga 


gaacacagac 


aatgaccact 


accctgggtg 


tgatatcaca 


4586 


ggaggccaga 


gagaggcaaa 


ggctcaggcc 


aagagccaac 


gctgtgggag 


gccggtcggc 


4646 


agccactccc 


tccagggcgc 


acctgcaggt 


ctgccatcca 


cggccttttc 


tggcaagaga 


4706 


agggcccagg 


aaggatgctc 


tcataaggcc 


caggaaggat 


gctctcataa 


gcaccttggt 


4766 


catggattag 


cccctcctgg 


aaaatggtgt 


tgggtttggt 


ctccagctcc 


aatacttatt 


4826 


aaggctgttg 


ctgccagtca 


aggccaccca 


ggagtctgaa 


ggctgggagc 


tcttggggct 


4886 


gggctggtcc 


tcccatcttc 


acctcgggcc 


tggatcccag 


gcctcaaacc 


agcccaaccc 


4946 


gagcttttgg 


acagctctcc 


agaagcatga 


actgcagtgg 


agatgaagat 


cctggctctg 


5006 


tgctgtgcac 


ataggtgttt 


aataaacatt 


tgttggcaga 


aaaaaaaaaa 


aaaaaaaaaa 


5066 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaa 








5092 
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Met Ala Glu Ala Gly Leu Arg Gly Trp Leu Leu Trp Ala Leu Leu Leu 
1 5 10 15 

Arg Leu Ala Gin Ser Glu Pro Tyr Thr Thr lie His Gin Pro Gly Tyr 
20 25 30 

Cys Ala Phe Tyr Asp Glu Cys Gly Lys Asn Pro Glu Leu Ser Gly Ser 
35 40 45 



Leu Met Thr Leu Ser Asn Val Ser Cys Leu Ser Asn Thr Pro Ala Arg 
50 55 60 



Lys lie Thr Gly Asp His Leu lie Leu Leu Gin Lys lie Cys Pro Arg 
65 70 75 80 



Leu Tyr Thr Gly Pro Asn Thr Gin Ala Cys Cys Ser Ala Lys Gin Leu 
85 90 95 



Val Ser Leu Glu Ala Ser Leu Ser lie Thr Lys Ala Leu Leu Thr Arg 
100 105 110 



Cys Pro Ala Cys Ser Asp Asn Phe Val Asn Leu His Cys His Asn Thr 
115 120 125 



Cys Ser Pro Asn Gin Ser Leu Phe lie Asn Val Thr Arg Val Ala Gin 
130 135 140 



Leu Gly Ala Gly Gin Leu Pro Ala Val Val Ala Tyr Glu Ala Phe Tyr 
145 150 155 160 



Gin His Ser Phe Ala Glu Gin Ser Tyr Asp Ser Cys Ser Arg Val Arg 
165 170 175 



Val Pro Ala Ala Ala Thr Leu Ala Val Gly Thr Met Cys Gly Val Tyr 
180 185 190 



Gly Ser Ala Leu Cys Asn Ala Gin Arg Trp Leu Asn Phe Gin Gly Asp 
195 200 205 



Thr Gly Asn Gly Leu Ala Pro Leu Asp lie Thr Phe His Leu Leu Glu 
210 215 220 



Pro Gly Gin Ala Val Gly Ser Gly He Gin Pro Leu Asn Glu Gly Val 
225 230 235 240 



Ala Arg Cys Asn Glu Ser Gin Gly Asp Asp Val Ala Thr Cys Ser Cys 
245 250 255 



Gin Asp Cys Ala Ala Ser Cys Pro Ala He Ala Arg Pro Gin Ala Leu 
260 265 270 



Asp Ser Thr Phe Tyr Leu Gly Gin Met Pro Gly Ser Leu Val Leu He 
275 280 285 



He He Leu Cys Ser Val Phe Ala Val Val Thr He Leu Leu Val Gly 



290 



295 



300 



Phe Arg Val Ala Pro Ala Arg Asp Lys Ser Lys Met Val Asp Pro Lys 
305 310 315 320 



Lys Gly Thr Ser Leu Ser Asp Lys Leu Ser Phe Ser Thr His Thr Leu 
325 330 335 



Leu Gly Gin Phe Phe Gin Gly Trp Gly Thr Trp Val Ala Ser Trp Pro 
340 345 350 



Leu Thr He Leu Val Leu Ser Val He Pro Val Val Ala Leu Ala Ala 
355 360 365 



Gly Leu Val Phe Thr Glu Leu Thr Thr Asp Pro Val Glu Leu Trp Ser 
370 375 * 380 



Ala Pro Asn Ser Gin Ala Arg Ser Glu Lys Ala Phe His Asp Gin His 
385 390 395 400 



Phe Gly Pro Phe Phe Arg Thr Asn Gin Val He Leu Thr Ala Pro Asn 
405 410 415 



Arg Ser Ser Tyr Arg Tyr Asp Ser Leu Leu Leu Gly Pro Lys Asn Phe 
- 420 425 430 



Ser Gly He Leu Asp Leu Asp Leu Leu Leu Glu Leu Leu Glu Leu Gin 
435 * 440 445 



Glu Arg Leu Arg His Leu Gin Val Trp Ser Pro Glu Ala Gin Arg Asn 
450 455 460 



He Ser Leu Gin Asp He Cys Tyr Ala Pro Leu Asn Pro Asp Asn Thr 
465 470 475 480 



Ser Leu Tyr Asp Cys Cys He Asn Ser Leu Leu Gin Tyr Phe Gin Asn 
485 490 495 



Asn Arg Thr Leu Leu Leu Leu Thr Ala Asn Gin Thr Leu Met Gly Gin 
500 505 510 



Thr Ser Gin Val Asp Trp Lys Asp His Phe Leu Tyr Cys Ala Asn Ala 
515 520 525 



Pro Leu Thr Phe Lys Asp Gly Thr Ala Leu Ala Leu Ser Cys Met Ala 
530 535 540 



Asp Tyr Gly Ala Pro Val Phe Pro Phe Leu Ala lie Gly Gly Tyr Lys 
545 ' 550 555 ~ 560 



Gly Lys Asp Tyr Ser Glu Ala Glu Ala Leu lie Met Thr Phe Ser Leu 
565 570 575 



Asn Asn Tyr Pro Ala Gly Asp Pro Arg Leu Ala Gin Ala Lys Leu Trp 
580 585 590 



Glu Glu Ala Phe Leu Glu Glu Met Arg Ala Phe Gin Arg Arg Met Ala 
595 600 605 



Gly Met Phe Gin Val Thr Phe Met Ala Glu Arg Ser Leu Glu Asp Glu 
610 615 620 



lie Asn Arg Thr Thr Ala Glu Asp Leu Pro lie Phe Ala Thr Ser Tyr 
625 630 635 640 



lie Val lie Phe Leu Tyr lie Ser Leu Ala Leu Gly Ser Tyr Ser Ser 
645 650 655 



Trp Ser Arg Val Met Val Asp Ser Lys Ala Thr Leu Gly Leu Gly Gly 
660 665 670 



Val Ala Val Val Leu Gly Ala Val Met Ala Ala Met Gly Phe Phe Ser 
675 680 685 



Tyr Leu Gly He Arg Ser Ser Leu Val He Leu Gin Val Val Pro Phe 
690 695 700 



Leu Val Leu Ser Val Gly Ala Asp Asn He Phe He Phe Val Leu Glu 
705 710 715 720 



Tyr Gin Arg Leu Pro Arg Arg Pro Gly Glu Pro Arg Glu Val His He 
725 730 735 



Gly Arg Ala Leu Gly Arg Val Ala Pro Ser Met Leu Leu Cys Ser Leu 
740 745 750 



Ser Glu Ala He Cys Phe Phe Leu Gly Ala Leu Thr Pro Met Pro Ala 
755 760 765 



Val Arg Thr Phe Ala Leu Thr Ser Gly Leu Ala Val He Leu Asp Phe 
770 775 780 



Leu Leu Gin Met Ser Ala Phe Val Ala Leu Leu Ser Leu Asp Ser Lys 
785 790 795 " 800 



Arg Gin Glu Ala Ser Arg Leu Asp Val Cys Cys Cys Val Lys Pro Gin 
805 810 815 



Glu Leu Pro Pro Pro Gly Gin Gly Glu Gly Leu Leu Leu Gly Phe Phe 
820 825 830 



Gin Lys Ala Tyr Ala Pro Phe Leu Leu His Trp He Thr Arg Gly Val 
835 840 845 



Val Leu Leu Leu Phe Leu Ala Leu Phe Gly Val Ser Leu Tyr Ser Met 
850 855 860 



Cys His He Ser Val Gly Leu Asp Gin Glu Leu Ala Leu Pro Lys Asp 
865 870 875 880 



Ser Tyr Leu Leu Asp Tyr Phe Leu Phe Leu Asn Arg Tyr Phe Glu Val 
885 890 " " 895 



Gly Ala Pro Val Tyr Phe Val Thr Thr Leu Gly Tyr Asn Phe Ser Ser 
900 905 ~ 910 



Glu Ala Gly Met Asn Ala He Cys Ser Ser Ala Gly Cys Asn Asn Phe 
915 920 925 



Ser Phe Thr Gin Lys He Gin Tyr Ala Thr Glu Phe Pro Glu Gin Ser 
930 935 940 



Tyr Leu Ala He Pro Ala Ser Ser Trp Val Asp Asp Phe He Asp Trp 
945 950 955 960 



Leu Thr Pro Ser Ser Cys Cys Arg Leu Tyr He Ser Gly Pro Asn Lys 
965 970 ' 975 



Asp Lys Phe Cys Pro Ser Thr Val Asn Ser Leu Asn Cys Leu Lys Asn 
980 985 ~ 990 



Cys Met Ser He Thr Met Gly Ser Val Arg Pro Ser Val Glu Gin Phe 
995 1000 1005 



His Lys Tyr Leu Pro Trp Phe Leu Asn Asp Arg Pro Asn lie Lys 
1010 1015 1020 



Cys Pro Lys Gly Gly Leu Ala Ala Tyr Ser Thr Ser Val Asn Leu 
1025 1030 1035 



Thr Ser Asp Gly Gin Val Leu Asp Thr Val Ala lie Leu Ser Pro 
1040 1045 1050 



Arg Leu Glu Tyr Ser Gly Thr lie Ser Ala His Cys Asn Leu Tyr 
1055 1060 1065 



Leu Leu Asp Ser Ala Ser Arg Phe Met Ala Tyr His Lys Pro Leu 
1070 1075 1080 



Lys Asn Ser Gin Asp Tyr Thr Glu Ala Leu Arg Ala Ala Arg Glu 
1085 1090 1095 



Leu Ala Ala Asn lie Thr Ala Asp Leu Arg Lys Val Pro Gly Thr 
1100 1105 1110 



Asp Pro Ala Phe Glu Val Phe Pro Tyr Thr lie Thr Asn Val Phe 
1115 1120 1125 



Tyr Glu Gin Tyr Leu Thr lie Leu Pro Glu Gly Leu Phe Met Leu 
1130 1135 ' 1140 



Ser Leu Cys Leu Val Pro Thr Phe Ala Val Ser Cys Leu Leu Leu 
1145 1150 1155 



Gly Leu Asp Leu Arg Ser Gly Leu Leu Asn Leu Leu Ser lie Val 
1160 1165 1170 



Met lie Leu Val Asp Thr Val Gly Phe Met Ala Leu Trp Gly He 
1175 1180 1185 



Ser Tyr Asn Ala Val Ser Leu He Asn Leu Val Ser Ala Val Gly 
1190 1195 1200 



Met Ser Val Glu Phe Val Ser His He Thr Arg Ser Phe Ala He 
1205 1210 1215 



Ser Thr Lys Pro Thr Trp Leu Glu Arg Ala Lys Glu Ala Thr He 



1220 1225 1230 



Ser Met Gly Ser Ala Val Phe Ala Gly Val Ala Met Thr Asn Leu 
1235 1240 1245 



Pro Gly He Leu Val Leu Gly Leu Ala Lys Ala Gin Leu He Gin 
1250 1255 1260 



He Phe Phe Phe Arg Leu Asn Leu Leu He Thr Leu Leu Gly Leu 
1265 1270 1275 



Leu His Gly Leu Val Phe Leu Pro Val He Leu Ser Tyr Val Gly 
1280 1285 1290 



Pro Asp Val Asn Pro Ala Leu Ala Leu Glu Gin Lys Arg Ala Glu 
1295 1300 1305 



Glu Ala Val Ala Ala Val Met Val Ala Ser Cys Pro Asn His Pro 
1310 1315 1320 



Ser Arg Val Ser Thr Ala Asp Asn He Tyr Val Asn His Ser Phe 
1325 1330 1335 



Glu Gly Ser He Lys Gly Ala Gly Ala He Ser Asn Phe Leu Pro 
1340 1345 1350 



Asn Asn Gly Arg Gin Phe 
1355 
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ggatcacttc 


ctggctctgg 


gatggcagct 


gcctggcagg 


gatggctgct 


ctgggccctg 


60 


ctcctgaatt 


cggcccaggg 


tgagctctac 


acacccactc 


acaaagctgg 


cttctgcacc 


120 


ttttatgaag 


agtgtgggaa 


gaacccagag 


ctttctggag 


gcctcacatc 


actatccaat 


180 


atctcctgct 


tgtctaatac 


cccagccccg 


ccatgtcaca 


ggtgaccacc 


tggctcttct 


240 


ccagcgcgtc 


tgtccccgcc 


tatacaatgg 


ccccaatgac 


acctatgcct 


gttgctctac 


300 



caagcagctg 


gtgtcattag 


acagtagcct 


gtctatcacc 


aaggccctcc 


ttacacgctg 


360 


cccggcatgc 


tctgaaaatt 


ttgtgagcat 


acactgtcat 


aatacctgca 


gccctgacca 


420 


gagcctcttc 


atcaatgtta 


ctcgcgtggt 


tcagcgggac 


cctggacagc 


ttcctgctgt 


480 


ggtggcctat 


gaggcctttt 


atcaacgcag 


ttttgcagag 


aaggcctatg 


agtcctgtag 


540 


ccgggtgcgc 


atccctgcag 


ctgcctcgct 


ggctgtgggc 


agcatgtgtg 


gagtgtatgg 


600 


ctctgccctc 


tgcaatgctc 


agcgcctggc 


tcaacttcca 


aggagacaca 


gggaatggcc 


660 


tggctccgct 


ggacatcacc 


ttccacctct 


tggagcctgg 


ccaggccctg 


gcagatggga 


720 


tgaagccact 


ggatgggaag 


atcaaaccct 


gcaatgagtc 


ccagggtgaa 


gactcggcag 


780 


cctgttcctg 


ccaggactgt 


gcagcatcct 


gccctgtcat 


ccctccgccc 


ccggccctgc 


840 


gcccttcttt 


ctacatgggt 


cgaatgccag 


gctggctggc 


tctcatcatc 


atcttcactg 


900 


ctgtctttgt 


attgctctct 


gttgtccttg 


tgtatctccg 


agtggcttcc 


aacaggaaca 


960 


agaacaagac 


agcaggctcc 


caggaagccc 


ccaacctccc 


tcgtaagcgc 


agattctcac 


1020 


ctcacactgt 


ccttggccgg 


ttcttcgaga 


gctggggaac 


aatggtggcc 


tcatggccac 


1080 


tcactgtctt 


ggcactgtcc 


ttcatagttg 


tgatagcctt 


gtcagtaggc 


ctgaccttta 


1140 


tagaactcac 


cacagaccct 


gtggaactgt 


ggtcggcccc 


taaaagccaa 


gcccggaaag 


1200 


aaaaggcttt 


ccatgacgag 


cattttggcc 


ccttcttccg 


aaccaaccag 


atttttgtga 


1260 


cagctaagaa 


caggtccagc 


tacaagtacg 


actccctgct 


gctagggccc 


aagaacttca 


1320 


gtgggatcct 


atccctggac 


ttgctgcagg 


agctgttgga 


gctacaggag 


agacttcgac 


1380 


acctgcaagt 


gtggtcccat 


gaggcacagc 


gcaacatctc 


cctccaggac 


atctgctatg 


1440 


ctcccctcaa 


accgcataac 


accagcctca 


ctgactgctg 


tgtcaacagc 


ctccttcaat 


1500 


acttccagaa 


caaccacaca 


ctcctgctgc 


tcacagccaa 


ccagactctg 


aatggccaga 


1560 


cctccctggt 


ggactggaag 


gaccatttcc 


tctactgtgc 


caatgcccct 


ctcacgtaca 


1620 


aagatggcac 


agccctggcc 


ctgagctgca 


tagctgacta 


cggggcgcct 


gtcttcccct 


1680 


tccttgctgt 


tgggggctac 


caagggacgg 


actactcgga 


ggcagaagcc 


ctgatcataa 


1740 


ccttctctat 


caataactac 


cccgctgatg 


atccccgcat 


ggcccacgcc 


aagctctggg 


1800 


aggaggcttt 


cttgaaggaa 


atgcaatcct 


tccagagaag 


cacagctgac 


aagttccaga 


1860 


ttgcgttctc 


agctgagcgt 


tctctggagg 


acgagatcaa 


tcgcactacc 


atccaggacc 


1920 


tgcctgtctt 


tgccatcagc 


taccttatcg 


tcttcctgta 


catctccctg 
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